Graduate Texts 
in Madieniatics 



An Introduction to 
Curvature 



Springer 


“Pu-^uc. Plt.y.ALC.A 



Graduate Texts in Mathematics 


176 


Editorial Board 
S. Axler F.W. Gehring P.R. Halmos 


Springer 

New York 

Berlin 

Heidelberg 

Barcelona 

Budapest 

Hong Kong 

London 

Milan 

Paris 

Santa Clara 

Singapore 

Tokyo 





Graduate Texts in Mathematics 


1 Takeuti/Zajring. Introduction to 
Axiomatic Set Theory. 2nd ed. 

2 OxTOBY. Measure and Category. 2nd ed. 

3 Schaefer. Topological Vector Spaces. 

4 Hilton/Stammbach. A Course in 
Homological Algebra. 2nd ed. 

5 Mac Lane. Categories for the Working 
Mathematician. 

6 Hughes/Piper. Projective Planes. 

7 Serre. a Course in Arithmetic. 

8 Takeuti/Zaring. Axiomatic Set Theory. 

9 Humphreys. Introduction to Lie Algebras 
and Representation Theory. 

10 Cohen. A Course in Simple Homotopy 
Theory. 

11 Conway. Functions of One Complex 
Variable I. 2nd ed. 

12 Beals. Advanced Mathematical Analysis. 

13 Anderson/Fuller. Rings and Categories 
of Modules. 2nd ed. 

14 Golubitsky/Guillemin. Stable Mappings 
and Their Singularities. 

15 Berbewan. Lectures in Functional 
Arralysis and Operator Theory. 

16 Winter. The Structure of Fields. 

17 Rosenblatt. Random Processes. 2nd ed. 

18 Halmos. Measure Theory. 

19 Halmos. A Hilbert Space Problem Book. 
2nd ed. 

20 Husemoller. Fibre Bundles. 3rd ed. 

21 Humphreys. Linear Algebraic Groups. 

22 Barnes/Mack. An Algebraic Introduction 
to Mathematical Logic. 

23 Greub. Linear Algebra. 4th ed. 

24 Holmes. Geometric Functional Analysis 
and Its Applications. 

25 Hewitt/Stromberg. Real and Abstract 
Analysis. 

26 Manes. Algebraic Theories. 

27 Kelley. General Topology. 

28 Zariski/Samuel. Commutative Algebra. 
Vol.I. 

29 Zariski/Samuel. Commutative Algebra. 
Vol.n. 

30 Jacobson. Lectures in Abstract Algebra I. 
Basic Concepts. 

31 Jacobson. Lectures in Abstract Algebra 
n. Linear Algebra. 

32 Jacobson. Lectures in Abstract Algebra 
ID. Theory of Fields and Galois Theory. 


33 Hirsch. Differential Topology. 

34 SpnzER. Principles of Random Walk. 

2nd ed. 

35 Wermer. Banach Algebras and Several 
Complex Variables. 2nd ed. 

36 Kelley/Namioka et al. Linear 
Topological Spaces. 

37 Monk. Mathematical Logic. 

38 Grauert/Fritzsche. Several Complex 
Variables. 

39 Arveson. An Invitation to C*-Algebras. 

40 Kemeny/Snell/Knapp. Demunerable 
Markov Chains. 2nd ed. 

41 Apostol. Modular Functions and 
Dirichlet Series in Number Theory. 

2nd ed. 

42 Serre. Linear Representations of Finite 
Groups. 

43 Gillman/Jerison. Rings of Continuous 
Functions. 

44 Kendig. Elementary Algebraic Geometry. 

45 Loeve. Probability Theory I. 4th ed. 

46 Loeve. Probability Theory II. 4th ed. 

47 Moise. Geometric Topology in 
Dimensions 2 and 3. 

48 Sachs/Wu. General Relativity for 
Mathematicians. 

49 Gruenberg/Weir. Linear Geometry. 

2nd ed. 

50 Edwards. Fermat’s Last Theorem. 

51 Kungenberg. a Course in Differential 
Geometry. 

52 Hartshorne. Algebraic Geometry. 

53 Manin. a Course in Mathematical Logic. 

54 Graver/Watkins. Combinatorics with 
Emphasis on the Theory of Graphs. 

55 Brown/Pearcy. Introduction to Operator 
Thecxy I; Elements of Functional 
Analysis. 

56 Massey. Algebraic Topology: An 
Introduction. 

57 Crowell/Fox. Introduction to Knot 
Theory. 

58 Kobutz. p-adic Numbers, p-adic 
Analysis, and Zeta-Functions. 2nd ed. 

59 Lang. Cyclotomic Fields. 

60 Arnold. Mathematical Methods in 
Classical Mechanics. 2nd ed. 

continued after index 


^uHjc. TfleL'tli.c.m.eE.'ttc.nl 



John M. Lee 


Riemannian Manifolds 

An Introduction to Curvature 


With 88 Illustrations 



Springer 


'T^€Lth.C.I*L€Ltl.C.€LL “Ph-^SjJILA. 



John M. Lee 

Department of Mathematics 
University of Washington 
Seattle, WA 98195-4350 
USA 


Editorial Board 
S. Axler 
Department of 
Mathematics 

Michigan State University 
East Lansing, MI 48824 
USA 


F.W. Gehring 
Department of 
Mathematics 
University of Michigan 
Ann Arbor, MI 48109 
USA 


P.R. Halmos 
Department of 
Mathematics 
Santa Clara University 
Santa Clara, CA 95053 
USA 


Mathematics Subject Classification (1991): 53-01, 53C20 


Library of Congress Cataloging-in-Publication Data 
Lee, John M., 1950- 

Reimannian manifolds : an introduction to curvature / John M. Lee. 

p. cm. — (Graduate texts in mathematics ; 176) 

Includes index. 

ISBN 0-387-98271-X (hardcover : alk. paper) 

1. Reimannian manifolds. I. Title. II. Series. 

QA649.L397 1997 

516.3'73—dc21 97-14537 


© 1997 Springer-Verlag New York, Inc. 

All rights reserved. This work may not be translated or copied in whole or in part without the written 
permission of the publisher (Springer-Verlag New York, Inc., 175 Fifth Avenue, New York, NY 
10010, USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in con¬ 
nection with any form of information storage and retrieval, electronic adaptation, computer software, 
or by similar or dissimilar methodology now known or hereafter developed is forbidden. 

The use of general descriptive names, trade names, trademarks, etc., in this publication, even if the 
former are not especially identified, is not to be taken as a sign that such names, as understood by the 
Trade Marks and Merchandise Marks Act, may accordingly be used freely by anyone. 


ISBN 0-387-98271-X Springer-Verlag New York Berlin Heidelberg SPIN 10630043 (hardcover) 
ISBN 0-387-98322-8 Springer-Verlag New York Berlin Heidelberg SPIN 10637299 (softcover) 


'T^€Lth.C.I*L€Ltl.C.€LL "Ph-^SjJILA. 


Preface 


This book is designed as a textbook for a one-quarter or one-semester grad¬ 
uate course on Riemannian geometry, for students who are familiar with 
topological and differentiable manifolds. It focuses on developing an inti¬ 
mate acquaintance with the geometric meaning of curvature. In so doing, it 
introduces and demonstrates the uses of all the main technical tools needed 
for a careful study of Riemannian manifolds. 

I have selected a set of topics that can reasonably be covered in ten to 
fifteen weeks, instead of making any attempt to provide an encyclopedic 
treatment of the subject. The book begins with a careful treatment of the 
machinery of metrics, connections, and geodesics, without which one cannot 
claim to be doing Riemannian geometry. It then introduces the Riemann 
curvature tensor, and quickly moves on to submanifold theory in order to 
give the curvature tensor a concrete quantitative interpretation. From then 
on, all efforts are bent toward proving the four most fundamental theorems 
relating curvature and topology: the Gauss-Bonnet theorem (expressing 
the total curvature of a surface in terms of its topological type), the Cartan- 
Hadamard theorem (restricting the topology of manifolds of nonpositive 
curvature). Bonnet’s theorem (giving analogous restrictions on manifolds 
of strictly positive curvature), and a special case of the Cartan-Ambrose- 
Hicks theorem (characterizing manifolds of constant curvature). 

Many other results and techniques might reasonably claim a place in an 
introductory Riemannian geometry course, but could not be included due 
to time constraints. In particular, I do not treat the Rauch comparison the¬ 
orem, the Morse index theorem, Toponogov’s theorem, or their important 
applications such as the sphere theorem, except to mention some of them 
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in passing; and I do not touch on the Laplace-Beltrami operator or Hodge 
theory, or indeed any of the multitude of deep and exciting applications 
of partial differential equations to Riemannian geometry. These important 
topics are for other, more advanced courses. 

The libraries already contain a wealth of superb reference books on Rie¬ 
mannian geometry, which the interested reader can consult for a deeper 
treatment of the topics introduced here, or can use to explore the more 
esoteric aspects of the subject. Some of my favorites are the elegant in¬ 
troduction to comparison theory by Jeff Cheeger and David Ebin [CE75] 
(which has sadly been out of print for a number of years); Manfredo do 
Carmo’s much more leisurely treatment of the same material and more 
[dC92]; Barrett O’Neill’s beautifully integrated introduction to pseudo- 
Riemannian and Riemannian geometry [0’N83]; Isaac Chavel’s masterful 
recent introductory text [Cha93], which starts with the foundations of the 
subject and quickly takes the reader deep into research territory; Michael 
Spivak’s classic tome [Spi79], which can be used as a textbook if plenty of 
time is available, or can provide enjoyable bedtime reading; and, of course, 
the “Encyclopaedia Britannica” of differential geometry books, Founda¬ 
tions of Differential Geometry by Kobayashi and Nomizu [KN63]. At the 
other end of the spectrum, Frank Morgan’s delightful little book [Mor93] 
touches on most of the important ideas in an intuitive and informal way 
with lots of pictures—I enthusiastically recommend it as a prelude to this 
book. 

It is not my purpose to replace any of these. Instead, it is my hope 
that this book will fill a niche in the literature by presenting a selective 
introduction to the main ideas of the subject in an easily accessible way. 
The selection is small enough to fit into a single course, but broad enough, 
I hope, to provide any novice with a firm foundation from which to pursue 
research or develop applications in Riemannian geometry and other fields 
that use its tools. 

This book is written under the assumption that the student already 
knows the fundamentals of the theory of topological and differential mani¬ 
folds, as treated, for example, in [Mas67, chapters 1-5] and [B 0086 , chapters 
1-6]. In particular, the student should be conversant with the fundamental 
group, covering spaces, the classification of compact surfaces, topological 
and smooth manifolds, immersions and submersions, vector fields and flows. 
Lie brackets and Lie derivatives, the Frobenius theorem, tensors, differen¬ 
tial forms, Stokes’s theorem, and elementary properties of Lie groups. On 
the other hand, I do not assume any previous acquaintance with Riemann¬ 
ian metrics, or even with the classical theory of curves and surfaces in R^. 
(In this subject, anything proved before 1950 can be considered “classi¬ 
cal.”) Although at one time it might have been reasonable to expect most 
mathematics students to have studied surface theory as undergraduates, 
few current North American undergraduate math majors see any differen- 
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tial geometry. Thus the fundamentals of the geometry of surfaces, including 
a proof of the Gauss-Bonnet theorem, are worked out from scratch here. 

The book begins with a nonrigorous overview of the subject in Chapter 
1 , designed to introduce some of the intuitions underlying the notion of 
curvature and to link them with elementary geometric ideas the student 
has seen before. This is followed in Chapter 2 by a brief review of some 
background material on tensors, manifolds, and vector bundles, included 
because these are the basic tools used throughout the book and because 
often they are not covered in quite enough detail in elementary courses 
on manifolds. Chapter 3 begins the course proper, with definitions of Rie- 
mannian metrics and some of their attendant flora and fauna. The end of 
the chapter describes the constant curvature “model spaces” of Riemannian 
geometry, with a great deal of detailed computation. These models form a 
sort of Zeztmotz/throughout the text, and serve as illustrations and testbeds 
for the abstract theory as it is developed. Other important classes of exam¬ 
ples are developed in the problems at the ends of the chapters, particularly 
invariant metrics on Lie groups and Riemannian submersions. 

Chapter 4 introduces connections. In order to isolate the important prop¬ 
erties of connections that are independent of the metric, as well as to lay the 
groundwork for their further study in such arenas as the Chern-Weil theory 
of characteristic classes and the Donaldson and Seiberg-Witten theories of 
gauge fields, connections are defined first on arbitrary vector bundles. This 
has the further advantage of making it easy to define the induced connec¬ 
tions on tensor bundles. Chapter 5 investigates connections in the context 
of Riemannian manifolds, developing the Riemannian connection, its geo¬ 
desics, the exponential map, and normal coordinates. Chapter 6 continues 
the study of geodesics, focusing on their distance-minimizing properties. 
First, some elementary ideas from the calculus of variations are introduced 
to prove that every distance-minimizing curve is a geodesic. Then the Gauss 
lemma is used to prove the (partial) converse—that every geodesic is lo¬ 
cally minimizing. Because the Gauss lemma also gives an easy proof that 
minimizing curves are geodesics, the calculus-of-variations methods are not 
strictly necessary at this point; they are included to facilitate their use later 
in comparison theorems. 

Chapter 7 unveils the first fully general definition of curvature. The cur¬ 
vature tensor is motivated initially by the question of whether all Riemann¬ 
ian metrics are locally equivalent, and by the failure of parallel translation 
to be path-independent as an obstruction to local equivalence. This leads 
naturally to a qualitative interpretation of curvature as the obstruction to 
flatness (local equivalence to Euclidean space). Chapter 8 departs some¬ 
what from the traditional order of presentation, by investigating subman¬ 
ifold theory immediately after introducing the curvature tensor, so as to 
define sectional curvatures and give the curvature a more quantitative ge¬ 
ometric interpretation. 
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The last three chapters are devoted to the most important elementary 
global theorems relating geometry to topology. Chapter 9 gives a simple 
moving-frames proof of the Gauss-Bonnet theorem, complete with a care¬ 
ful treatment of Hopf’s rotation angle theorem (the Umlaufsatz). Chapter 
10 is largely of a technical nature, covering Jacobi fields, conjugate points, 
the second variation formula, and the index form for later use in com¬ 
parison theorems. Finally in Chapter 11 comes the denouement —proofs of 
some of the “big” global theorems illustrating the ways in which curvature 
and topology affect each other: the Cartan-Hadamard theorem. Bonnet’s 
theorem (and its generalization, Myers’s theorem), and Cartan’s character¬ 
ization of manifolds of constant curvature. 

The book contains many questions for the reader, which deserve special 
mention. They fall into two categories: “exercises,” which are integrated 
into the text, and “problems,” grouped at the end of each chapter. Both are 
essential to a full understanding of the material, but they are of somewhat 
different character and serve different purposes. 

The exercises include some background material that the student should 
have seen already in an earlier course, some proofs that fill in the gaps from 
the text, some simple but illuminating examples, and some intermediate 
results that are used in the text or the problems. They are, in general, 
elementary, but they are not optional —indeed, they are integral to the 
continuity of the text. They are chosen and timed so as to give the reader 
opportunities to pause and think over the material that has just been intro¬ 
duced, to practice working with the definitions, and to develop skills that 
are used later in the book. I recommend strongly that students stop and 
do each exercise as it occurs in the text before going any further. 

The problems that conclude the chapters are generally more difficult 
than the exercises, some of them considerably so, and should be considered 
a central part of the book by any student who is serious about learning the 
subject. They not only introduce new material not covered in the body of 
the text, but they also provide the student with indispensable practice in 
using the techniques explained in the text, both for doing computations and 
for proving theorems. If more than a semester is available, the instructor 
might want to present some of these problems in class. 


Acknowledgments: I owe an unpayable debt to the authors of the many 
Riemannian geometry books I have used and cherished over the years, 
especially the ones mentioned above—I have done little more than rear¬ 
range their ideas into a form that seems handy for teaching. Beyond that, 
I would like to thank my Ph.D. advisor, Richard Melrose, who many years 
ago introduced me to differential geometry in his eccentric but thoroughly 
enlightening way; Judith Arms, who, as a fellow teacher of Riemannian 
geometry at the University of Washington, helped brainstorm about the 
“ideal contents” of this course; all my graduate students at the University 
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of Washington who have suffered with amazing grace through the ffawed 
early drafts of this book, especially Jed Mihalisin, who gave the manuscript 
a meticulous reading from a user’s viewpoint and came up with numerous 
valuable suggestions; and Ina Lindemann of Springer-Verlag, who encour¬ 
aged me to turn my lecture notes into a book and gave me free rein in de¬ 
ciding on its shape and contents. And of course my wife, Pm Weizenbaum, 
who contributed professional editing help as well as the loving support and 
encouragement I need to keep at this day after day. 
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1 

What Is Curvature? 


If you’ve just completed an introductory course on differential geometry, 
you might be wondering where the geometry went. In most people’s expe¬ 
rience, geometry is concerned with properties such as distances, lengths, 
angles, areas, volumes, and curvature. These concepts, however, are barely 
mentioned in typical beginning graduate courses in differential geometry; 
instead, such courses are concerned with smooth structures, flows, tensors, 
and differential forms. 

The purpose of this book is to introduce the theory of Riemannian 
manifolds: these are smooth manifolds equipped with Riemannian met¬ 
rics (smoothly varying choices of inner products on tangent spaces), which 
allow one to measure geometric quantities such as distances and angles. 
This is the branch of modern differential geometry in which “geometric” 
ideas, in the familiar sense of the word, come to the fore. It is the direct 
descendant of Euclid’s plane and solid geometry, by way of Gauss’s theory 
of curved surfaces in space, and it is a dynamic subject of contemporary 
research. 

The central unifying theme in current Riemannian geometry research is 
the notion of curvature and its relation to topology. This book is designed 
to help you develop both the tools and the intuition you will need for an in- 
depth exploration of curvature in the Riemannian setting. Unfortunately, 
as you will soon discover, an adequate development of curvature in an 
arbitrary number of dimensions requires a great deal of technical machinery, 
making it easy to lose sight of the underlying geometric content. To put 
the subject in perspective, therefore, let’s begin by asking some very basic 
questions: What is curvature? What are the important theorems about it? 
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1. What Is Curvature? 


In this chapter, we explore these and related questions in an informal way, 
without proofs. In the next chapter, we review some basic material about 
tensors, manifolds, and vector bundles that is used throughout the book. 
The “official” treatment of the subject begins in Chapter 3. 


The Euclidean Plane 

To get a sense of the kinds of questions Riemannian geometers address 
and where these questions came from, let’s look back at the very roots of 
our subject. The treatment of geometry as a mathematical subject began 
with Euclidean plane geometry, which you studied in school. Its elements 
are points, lines, distances, angles, and areas. Here are a couple of typical 
theorems: 

Theorem 1.1. (SSS) Two Euclidean triangles are congruent if and only 
if the lengths of their corresponding sides are equal. 

Theorem 1.2. (Angle-Sum Theorem) The sum of the interior angles 
of a Euclidean triangle is tt. 

As trivial as they seem, these two theorems serve to illustrate two major 
types of results that permeate the study of geometry; in this book, we call 
them “classification theorems” and “local-global theorems.” 

The SSS (Side-Side-Side) theorem is a classification theorem. Such a 
theorem tells us that to determine whether two mathematical objects are 
equivalent (under some appropriate equivalence relation), we need only 
compare a small (or at least finite!) number of computable invariants. In 
this case the equivalence relation is congruence—equivalence under the 
group of rigid motions of the plane—and the invariants are the three side 
lengths. 

The angle-sum theorem is of a different sort. It relates a local geometric 
property (angle measure) to a global property (that of being a three-sided 
polygon or triangle). Most of the theorems we study in this book are of 
this type, which, for lack of a better name, we call local-global theorems. 

After proving the basic facts about points and lines and the figures con¬ 
structed directly from them, one can go on to study other figures derived 
from the basic elements, such as circles. Two typical results about circles 
are given below; the first is a classification theorem, while the second is a 
local-global theorem. (It may not be obvious at this point why we consider 
the second to be a local-global theorem, but it will become clearer soon.) 

Theorem 1.3. (Circle Classification Theorem) Two circles in the Eu¬ 
clidean plane are congruent if and only if they have the same radius. 
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FIGURE 1.1. Osculating circle. 


Theorem 1.4. (Circumference Theorem) The circumference of a Eu¬ 
clidean circle of radius R is 2 ttR. 

If you want to continue your study of plane geometry beyond figures 
constructed from lines and circles, sooner or later you will have to come to 
terms with other curves in the plane. An arbitrary curve cannot be com¬ 
pletely described by one or two numbers such as length or radius; instead, 
the basic invariant is curvature, which is defined using calculus and is a 
function of position on the curve. 

Formally, the curvature of a plane curve 7 is defined to be k(<) := | 7 (t)|, 
the length of the acceleration vector, when 7 is given a unit speed param- 
etrization. (Here and throughout this book, we think of curves as param¬ 
etrized by a real variable t, with a dot representing a derivative with respect 
to t.) Geometrically, the curvature has the following interpretation. Given 
a point p = 7 (t), there are many circles tangent to 7 at p —namely, those 
circles that have a parametric representation whose velocity vector at p is 
the same as that of 7 , or, equivalently, all the circles whose centers lie on 
the line orthogonal to 7 at p. Among these parametrized circles, there is 
exactly one whose acceleration vector at p is the same as that of 7 ; it is 
called the osculating circle (Figure 1.1). (If the acceleration of 7 is zero, 
replace the osculating circle by a straight line, thought of as a “circle with 
infinite radius.”) The curvature is then K{t) = 1/R, where R is the radius of 
the osculating circle. The larger the curvature, the greater the acceleration 
and the smaller the osculating circle, and therefore the faster the curve is 
turning. A circle of radius R obviously has constant curvature k = 1/R, 
while a straight line has curvature zero. 

It is often convenient for some purposes to extend the definition of the 
curvature, allowing it to take on both positive and negative values. This 
is done by choosing a unit normal vector field N along the curve, and 
assigning the curvature a positive sign if the curve is turning toward the 





4 


1. What Is Curvature? 


chosen normal or a negative sign if it is turning away from it. The resulting 
function along the curve is then called the signed curvature. 

Here are two typical theorems about plane curves: 

Theorem 1.5. (Plane Curve Classification Theorem) Suppose 7 and 
7 : [a, b] are smooth, unit speed plane curves with unit normal vec¬ 

tor fields N and N, and KN{t), Kf^it) represent the signed curvatures at 
7(t) and 7(t), respectively. Then 7 and 7 are congruent {by a direction¬ 
preserving congruence) if and only if KN^t) = npiff) for all t G [a,b]. 


Theorem 1.6. (Total Curvature Theorem) 1 / 7 : [a,b] is a unit 

speed simple closed curve such that 7(a) = 7(6), and N is the inward¬ 
pointing normal, then 


fb 

/ KN{t) dt = 2 tt. 

J a 


The first of these is a classification theorem, as its name suggests. The 
second is a local-global theorem, since it relates the local property of cur¬ 
vature to the global (topological) property of being a simple closed curve. 
The second will be derived as a consequence of a more general result in 
Chapter 9; the proof of the first is left to Problem 9-6. 

It is interesting to note that when we specialize to circles, these theorems 
reduce to the two theorems about circles above: Theorem 1.5 says that two 
circles are congruent if and only if they have the same curvature, while The¬ 
orem 1.6 says that if a circle has curvature k and circumference C, then 
nC = 2 tt. It is easy to see that these two results are equivalent to Theo¬ 
rems 1.3 and 1.4. This is why it makes sense to consider the circumference 
theorem as a local-global theorem. 


Surfaces in Space 

The next step in generalizing Euclidean geometry is to start working 
in three dimensions. After investigating the basic elements of “solid 
geometry”—points, lines, planes, distances, angles, areas, volumes—and 
the objects derived from them, such as polyhedra and spheres, one is led 
to study more general curved surfaces in space ( 2 -dimensional embedded 
submanifolds of R^, in the language of differential geometry). The basic 
invariant in this setting is again curvature, but it’s a bit more complicated 
than for plane curves, because a surface can curve differently in different 
directions. 

The curvature of a surface in space is described by two numbers at each 
point, called the principal curvatures. We define them formally in Chapter 
8 , but here’s an informal recipe for computing them. Suppose S' is a surface 
in R3, p is a point in S, and is a unit normal vector to S at p. 
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FIGURE 1.2. Computing principal curvatures. 


1. Choose a plane 11 through p that contains N. The intersection of 11 
with S is then a plane curve 7 C 11 passing through p (Figure 1.2). 

2. Compute the signed curvature km of 7 at p with respect to the chosen 
unit normal N. 

3. Repeat this for all normal planes 11. The principal curvatures of S at 
p, denoted ki and K 2 , are defined to be the minimum and maximum 
signed curvatures so obtained. 

Although the principal curvatures give us a lot of information about the 
geometry of S, they do not directly address a question that turns out to 
be of paramount importance in Riemannian geometry: Which properties 
of a surface are intrinsic? Roughly speaking, intrinsic properties are those 
that could in principle be measured or determined by a 2 -dimensional being 
living entirely within the surface. More precisely, a property of surfaces in 
R3 is called intrinsic if it is preserved by isometries (maps from one surface 
to another that preserve lengths of curves). 

To see that the principal curvatures are not intrinsic, consider the fol¬ 
lowing two embedded surfaces Si and S 2 in (Figures 1.3 and 1.4). Si 
is the portion of the xp-plane where 0 < y < tt, and S 2 is the half-cylinder 
{{x, y, z) : = 1, z > 0}. If we follow the recipe above for computing 

principal curvatures (using, say, the downward-pointing unit normal), we 
find that, since all planes intersect in straight lines, the principal cur- 
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z 





FIGURE 1.3. ^i. 


FIGURE 1.4. S 2 . 


vatures of Si are Ki = K 2 = 0. On the other hand, it is not hard to see 
that the principal curvatures of S 2 are ki = 0 and K 2 = 1. However, the 
map taking (a;,y, 0) to (a;,cost/,siny) is a diffeomorphism between 5*1 and 
S 2 that preserves lengths of curves, and is thus an isometry. 

Even though the principal curvatures are not intrinsic. Gauss made the 
surprising discovery in 1827 [Gau65] (see also [Spi79, volume 2] for an 
excellent annotated version of Gauss’s paper) that a particular combination 
of them is intrinsic. He found a proof that the product K = K 1 K 2 , now called 
the Gaussian curvature, is intrinsic. He thought this result was so amazing 
that he named it Theorema Egregium, which in colloquial American English 
can be translated roughly as “Totally Awesome Theorem.” We prove it in 
Ghapter 8. 

To get a feeling for what Gaussian curvature tells us about surfaces, let’s 
look at a few examples. Simplest of all is the plane, which, as we have 
seen, has both principal curvatures equal to zero and therefore has con¬ 
stant Gaussian curvature equal to zero. The half-cylinder described above 
also has K = K 1 K 2 = O'l = 0- Another simple example is a sphere of 
radius R. Any normal plane intersects the sphere in great circles, which 
have radius R and therefore curvature ±l/i? (with the sign depending on 
whether we choose the outward-pointing or inward-pointing normal). Thus 
the principal curvatures are both equal to ±l/R, and the Gaussian curva¬ 
ture is K 1 K 2 = IjR^. Note that while the signs of the principal curvatures 
depend on the choice of unit normal, the Gaussian curvature does not: it 
is always positive on the sphere. 

Similarly, any surface that is “bowl-shaped” or “dome-shaped” has posi¬ 
tive Gaussian curvature (Figure 1.5), because the two principal curvatures 
always have the same sign, regardless of which normal is chosen. On the 
other hand, the Gaussian curvature of any surface that is “saddle-shaped” 
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FIGURE 1.5. K >0. 


FIGURE 1.6. K <0. 


is negative (Figure 1.6), because the principal curvatures are of opposite 
signs. 

The model spaces of surface theory are the surfaces with constant Gaus¬ 
sian curvature. We have already seen two of them: the Euclidean plane 
{K = 0), and the sphere of radius R {K = 1/i?^). The third model 
is a surface of constant negative curvature, which is not so easy to visual¬ 
ize because it cannot be realized globally as an embedded surface in R^. 
Nonetheless, for completeness, let’s just mention that the upper half-plane 
{{x,y) : y > 0} with the Riemannian metric g = R^y~^(dx^ + dy^) has con¬ 
stant negative Gaussian curvature K = —XjR?. In the special case i? = 1 
(so K = —1), this is called the hyperbolic plane. 

Surface theory is a highly developed branch of geometry. Of all its results, 
two—a classification theorem and a local-global theorem—are universally 
acknowledged as the most important. 

Theorem 1.7. (Uniformization Theorem) Every connected 2-mani¬ 
fold is diffeomorphic to a quotient of one of the three constant curvature 
model surfaces listed above by a discrete group of isometries acting freely 
and properly discontinuously. Therefore, every connected 2-manifold has a 
complete Riemannian metric with constant Gaussian curvature. 


Theorem 1.8. (Gauss—Bonnet Theorem) Let S be an oriented com¬ 
pact 2-manifold with a Riemannian metric. Then 


KdA = 2Trx{S), 


where x(5') is the Euler characteristic of S {which is equal to 2 if S is the 
sphere, 0 if it is the torus, and 2 — 2g if it is an orientable surface of genus 
9 )- 

The uniformization theorem is a classification theorem, because it re¬ 
places the problem of classifying surfaces with that of classifying discrete 
groups of isometries of the models. The latter problem is not easy by any 
means, but it sheds a great deal of new light on the topology of surfaces 
nonetheless. Although stated here as a geometric-topological result, the 
uniformization theorem is usually stated somewhat differently and proved 
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using complex analysis; we do not give a proof here. If you are familiar with 
complex analysis and the complex version of the uniformization theorem, it 
will be an enlightening exercise after you have finished this book to prove 
that the complex version of the theorem is equivalent to the one stated 
here. 

The Gauss-Bonnet theorem, on the other hand, is purely a theorem of 
differential geometry, arguably the most fundamental and important one 
of all. We go through a detailed proof in Chapter 9. 

Taken together, these theorems place strong restrictions on the types of 
metrics that can occur on a given surface. For example, one consequence of 
the Gauss-Bonnet theorem is that the only compact, connected, orientable 
surface that admits a metric of strictly positive Gaussian curvature is the 
sphere. On the other hand, if a compact, connected, orientable surface 
has nonpositive Gaussian curvature, the Gauss-Bonnet theorem forces its 
genus to be at least 1, and then the uniformization theorem tells us that 
its universal covering space is topologically equivalent to the plane. 


Curvature in Higher Dimensions 

We end our survey of the basic ideas of geometry by mentioning briefly how 
curvature appears in higher dimensions. Suppose M is an n-dimensional 
manifold equipped with a Riemannian metric g. As with surfaces, the ba¬ 
sic geometric invariant is curvature, but curvature becomes a much more 
complicated quantity in higher dimensions because a manifold may curve 
in so many directions. 

The first problem we must contend with is that, in general, Riemannian 
manifolds are not presented to us as embedded submanifolds of Euclidean 
space. Therefore, we must abandon the idea of cutting out curves by in¬ 
tersecting our manifold with planes, as we did when defining the princi¬ 
pal curvatures of a surface in R^. Instead, we need a more intrinsic way 
of sweeping out submanifolds. Fortunately, geodesics —curves that are the 
shortest paths between nearby points—are ready-made tools for this and 
many other purposes in Riemannian geometry. Examples are straight lines 
in Euclidean space and great circles on a sphere. 

The most fundamental fact about geodesics, which we prove in Ghapter 
4, is that given any point p € M and any vector V tangent to M at p, there 
is a unique geodesic starting at p with initial tangent vector V. 

Here is a brief recipe for computing some curvatures at a point p € M: 

1. Pick a 2-dimensional subspace H of the tangent space to M at p. 

2. Look at all the geodesics through p whose initial tangent vectors lie in 
the selected plane H. It turns out that near p these sweep out a certain 
2-dimensional submanifold S'n of M, which inherits a Riemannian 
metric from M. 
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3. Compute the Gaussian curvature of S'n at p, which the Theorema 
Egregium tells us can be computed from its Riemannian metric. This 
gives a number, denoted Ri(n), called the sectional curvature of M 
at p associated with the plane 11. 

Thus the “curvature” of M at p has to be interpreted as a map 

K : {2-planes in TpM} R. 

Again we have three constant (sectional) curvature model spaces: R" 
with its Euclidean metric (for which AT = 0); the n-sphere of radius R, 
with the Riemannian metric inherited from R"+^ {K = 1/i?^); and hyper¬ 
bolic space of radius R, which is the upper half-space {x G R" : cc" > 0} 
with the metric hn := {K = —1/i?^). Unfortunately, 

however, there is as yet no satisfactory uniformization theorem for Rie¬ 
mannian manifolds in higher dimensions. In particular, it is definitely not 
true that every manifold possesses a metric of constant sectional curvature. 
In fact, the constant curvature metrics can all be described rather explicitly 
by the following classification theorem. 

Theorem 1.9. (Classification of Constant Curvature Metrics) A 

complete, connected Riemannian manifold M with constant sectional cur¬ 
vature is isometric to M/T, where M is one of the constant curvature 
model spaces R", S^, or H^, and T is a discrete group of isometries of 
M, isomorphic to and acting freely and properly discontinuously 

on M. 

On the other hand, there are a number of powerful local-global theorems, 
which can be thought of as generalizations of the Gauss-Bonnet theorem in 
various directions. They are consequences of the fact that positive curvature 
makes geodesics converge, while negative curvature forces them to spread 
out. Here are two of the most important such theorems: 

Theorem 1.10. (Cartan—Hadamard) Suppose M is a complete, con¬ 
nected Riemannian n-manifold with all sectional curvatures less than or 
equal to zero. Then the universal covering space of M is diffeomorphic to 

R”. 


Theorem 1.11. (Bonnet) Suppose M is a complete, connected Riemann¬ 
ian manifold with all sectional curvatures hounded below by a positive con¬ 
stant. Then M is compact and has a finite fundamental group. 

Looking back at the remarks concluding the section on surfaces above, 
you can see that these last three theorems generalize some of the conse¬ 
quences of the uniformization and Gauss-Bonnet theorems, although not 
their full strength. It is the primary goal of this book to prove Theorems 
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1.9, 1.10, and 1.11; it is a primary goal of current research in Riemann- 
ian geometry to improve upon them and further generalize the results of 
surface theory to higher dimensions. 
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Review of Tensors, Manifolds, and 
Vector Bundles 


Most of the technical machinery of Riemannian geometry is built up us¬ 
ing tensors; indeed, Riemannian metrics themselves are tensors. Thus we 
begin by reviewing the basic definitions and properties of tensors on a 
finite-dimensional vector space. When we put together spaces of tensors 
on a manifold, we obtain a particularly useful type of geometric structure 
called a “vector bundle,” which plays an important role in many of our 
investigations. Because vector bundles are not always treated in beginning 
manifolds courses, we include a fairly complete discussion of them in this 
chapter. The chapter ends with an application of these ideas to tensor bun¬ 
dles on manifolds, which are vector bundles constructed from tensor spaces 
associated with the tangent space at each point. 

Much of the material included in this chapter should be familiar from 
your study of manifolds. It is included here as a review and to establish 
our notations and conventions for later use. If you need more detail on any 
topics mentioned here, consult [B 0086 ] or [Spi79, volume 1]. 


Tensors on a Vector Space 

Let V he a finite-dimensional vector space (all our vector spaces and man¬ 
ifolds are assumed real). As usual, V* denotes the dual space of V —the 
space of covectors, or real-valued linear functionals, on V —and we denote 
the natural pairing R* x R ^ R by either of the notations 

{lv, X) 1 -^ {to, X) or {lv, X) 1 -^ Lo{X) 





12 


2. Review of Tensors, Manifolds, and Vector Bundles 


for ojGV*, X eV. 

A covariant k-tensor on is a multilinear map 
F: V x-'-xV 

k copies 

Similarly, a contravariant I-tensor is a multilinear map 

F: V* X ••• X V* -> R. 

I copies 

We often need to consider tensors of mixed types as well. A tensor of type 
(^), also called a k-covariant, I-contravariant tensor, is a multilinear map 

F: V* X ■■■ xV* xV X ■■■ xV -^R. 

I copies k copies 

Actually, in many cases it is necessary to consider multilinear maps whose 
arguments consist of k vectors and I covectors, but not necessarily in the 
order implied by the definition above; such an object is still called a tensor 
of type (^). For any given tensor, we will make it clear which arguments 
are vectors and which are covectors. 

The space of all covariant fc-tensors on V is denoted by T^(V), the space 
of contravariant /-tensors by Ti{V), and the space of mixed (j)-tensors by 
Ti{V). The rank of a tensor is the number of arguments (vectors and/or 
covectors) it takes. 

There are obvious identifications Tq{V) = T^{V), T°(V) = Ti(V), 
T^(V) = V*, Ti(V) = V** = V, and T°(V) = R. A less obvious, but 
extremely important, identification is Tf(V) = End(V), the space of linear 
endomorphisms of V (linear maps from V to itself). A more general version 
of this identification is expressed in the following lemma. 

Lemma 2.1. Let V he a finite-dimensional vector space. There is a nat¬ 
ural {basis-independent) isomorphism between Tj^j^(V) and the space of 
multilinear maps 

V* X ■■■ xV* xV X ■■■ xV ^V. 

I k 

Exercise 2.1. Prove Lemma 2.1. [Hint: In the special case fc = 1, / = 0, 
consider the map <I>: End(V) ^ T’i(F) by letting <1>A be the (()-tensor 
defined by ^A{ui,X) = u!{AX). The general case is similar.] 

There is a natural product, called the tensor product, linking the various 
tensor spaces over V; if F G Tj^iV) and G G TP{V), the tensor F ^ G G 
T)^^(E) is defined by 

F®G(wi,...,w'+«,Ai,...,Afc+p) 

= F{u;\ ...,cc', Ai,..., Xk)G{u^+\ ...,u;'+«, X^+i ,..., X^+p). 
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If {El,..., En) is a basis for V, we let ..., denote the corre¬ 
sponding dual basis for V*, defined by (p^{Ej) = Sj. A basis for Tl^{V) is 
given by the set of all tensors of the form 

Ej, (g) (g) • • • 0 (2.1) 

as the indices ip, jq range from 1 to n. These tensors act on basis elements 
by 


^31 


' Ej^ ® ® ■ ■ ■ ® ,...,ip'‘‘,Er^,...,Er,^) 


ri 5 • • • 5 ^Tk ) 

31 31 r I ^rk’ 


Any tensor F £ T^{V) can be written in terms of this basis as 


jp _ Tp3l---3l jp 
^ — ^ii...2k ^31 


Ej^ ® ® ■ ■ ■ ® , 


( 2 . 2 ) 


where 


F^^J‘=F{^^\...,^\Ei„...,E,J. 

In (2.2), and throughout this book, we use the Einstein summation con¬ 
vention for expressions with indices: if in any term the same index name 
appears twice, as both an upper and a lower index, that term is assumed to 
be summed over all possible values of that index (usually from 1 to the di¬ 
mension of the space). We always choose our index positions so that vectors 
have lower indices and covectors have upper indices, while the components 
of vectors have upper indices and those of covectors have lower indices. 
This ensures that summations that make mathematical sense always obey 
the rule that each repeated index appears once up and once down in each 
term to be summed. 

If the arguments of a mixed tensor F occur in a nonstandard order, then 
the horizontal as well as vertical positions of the indices are significant and 
reflect which arguments are vectors and which are covectors. For example, 
if B is a (^)-tensor whose first argument is a vector, second is a covector, 
and third is a vector, its components are written 

Bih = B{E„ip^,Ek). (2.3) 

We can use the result of Lemma 2.1 to define a natural operation called 
trace or contraction, which lowers the rank of a tensor by 2. In one special 
case, it is easy to describe: the operator tr: Ti{V) ^ R is just the trace 
of F when it is considered as an endomorphism of V. Since the trace of 
an endomorphism is basis-independent, this is well defined. More generally, 
we define tr: r;^^(F) ^ by letting trF(w^, ... ,uj‘,Vi,..., Vk) be 

the trace of the endomorphism 

F{u3\...,J,-,Vi,...,Vk,-)eTl{V). 
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In terms of a basis, the components of tr F are 

V ii...1^771' 

Even more generally, we can contract a given tensor on any pair of indices 
as long as one is contravariant and one is covariant. There is no general 
notation for this operation, so we just describe it in words each time it 
arises. For example, we can contract the tensor B with components given 
by (2.3) on its first and second indices to obtain a covariant 1-tensor A 
whose components are Ak = Bi^k- 

Exercise 2.2. Show that the trace on any pair of indices is a well-defined 
linear map from to Tj‘{V). 

A class of tensors that plays a special role in differential geometry is that 
of alternating tensors: those that change sign whenever two arguments 
are interchanged. We let A^{V) denote the space of covariant alternating 
fc-tensors on V, also called k-covectors or {exterior) k-forms. There is a 
natural bilinear, associative product on forms called the wedge product, 
defined on 1-forms by setting 

uj^ A • • • A iu^{Xi, ..., Xf^) = det((cu^, Xj)), 

and extending by linearity. (There is an alternative definition of the wedge 
product in common use, which amounts to multiplying our wedge prod¬ 
uct by a factor of l/kl. The choice of which definition to use is a matter 
of convention, though there are various reasons to justify each choice de¬ 
pending on the context. The definition we have chosen is most common 
in introductory differential geometry texts, and is used, for example, in 
[Boo86, Cha93, dC92, Spi79]. The other convention is used in [KN63] and 
is more common in complex differential geometry.) 


Manifolds 

Now we turn our attention to manifolds. Throughout this book, all our 
manifolds are assumed to be smooth, Hausdorff, and second countable; 
and smooth always means C°°, or infinitely differentiable. As in most parts 
of differential geometry, the theory still works under weaker differentiabil¬ 
ity assumptions, but such considerations are usually relevant only when 
treating questions of hard analysis that are beyond our scope. 

We write local coordinates on any open subset U C M as (x^,... ,x”), 
(x*), or X, depending on context. Although, formally speaking, coordinates 
constitute a map from U to R", it is more common to use a coordinate 
chart to identify U with its image in R", and to identify a point in U with 
its coordinate representation (x*) in R". 
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For any p G M, the tangent space TpM can be characterized either as the 
set of derivations of the algebra of germs at p of C°° functions on M (i.e., 
tangent vectors are “directional derivatives”), or as the set of equivalence 
classes of curves through p under a suitable equivalence relation (i.e., tan¬ 
gent vectors are “velocities”). Regardless of which characterization is taken 
as the definition, local coordinates (a;*) give a basis for TpM consisting of 
the partial derivative operators djdx^. When there can be no confusion 
about which coordinates are meant, we usually abbreviate d/dx^ by the 
notation di. 

On a finite-dimensional vector space V with its standard smooth mani¬ 
fold structure, there is a natural (basis-independent) identification of each 
tangent space TpV with V itself, obtained by identifying a vector X G V 
with the directional derivative 


Xf 


_d 

dt 


f{p + tX). 

t=o 


In terms of the coordinates (x*) induced on V by any basis, this is just the 
usual identification (x^,..., x") x*9i. 

In this book, we always write coordinates with upper indices, as in (x*). 
This has the consequence that the differentials dx* of the coordinate func¬ 
tions are consistent with the convention that covectors have upper indices. 
Likewise, the coordinate vectors di = d/dx^ have lower indices if we con¬ 
sider jm upper index “in the denominator” to be the same as a lower index. 

If M is a smooth manifold, a submanifold (or immersed submanifold) of 
M is a smooth manifold M together with an injective immersion t: M 
M. Identifying M with its image l{M) C M, we can consider M as a subset 
of M, although in general thej^pology and smooth structure of M may 
have little to do with those of M and have to be considered as extra data. 
The most important type of submanifold is that in which the inclusion 
map L is an embedding, which means that it is a homeomorphism onto its 
image with the subspace topology. In that case, M is called an embedded 
submanifold or a regular submanifold. 

Suppose M is an embedded n-dimensional submanifold of an m- 
dimensional manifold M. For every point p G Mjjthere exist slice coor¬ 
dinates (x^,..., X™) on a neighborhood IX of p in M such that If n M is 
given by {x : x”+^ = ... = x™ = 0}, and (x^,...,x"') form local coor¬ 
dinates for M (Figure 2.1). At each g € If H M, TqM can be naturally 
identified as the subspace of TqM spanned by the vectors (9i,..., 9„). 


Exercise 2.3. Suppose M G M is an embedded submanifold. 

(a) If / is any smooth function on M, show that / can be extended to a 
smooth function on M whose restriction to M is /. [Hint: Extend / lo¬ 
cally in slice coordinates by letting it be independent of (x”"*"^,..., x"“), 
and patch together using a partition of unity.] 
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FIGURE 2.1. Slice coordinates. 


(b) Show that any vector field on M can be extended to a vector field on 
M. 

(c) If X is a vector field on M, show that X is tangent to M at points 
of M if and only if X/ = 0 whenever / £ C°°{M) is a function that 
vanishes on M. 


Vector Bundles 

When we glue together the tangent spaces at all points on a manifold M, 
we get a set that can be thought of both as a union of vector spaces and 
as a manifold in its own right. This kind of structure is so common in 
differential geometry that it has a name. 

A (smooth) k-dimensional vector bundle is a pair of smooth manifolds E 
(the total space) and M (the base), together with a surjective map n: E ^ 
M (the projection), satisfying the following conditions: 

(a) Each set Ep := Tr~^{p) (called the jiber of E over p) is endowed with 
the structure of a vector space. 

(b) For each p G M, there exists a neighborhood U oip and a diffeomor- 
phism ip: tt~^(U) ^ U x R* (Figure 2.2), called a local trivialization 
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FIGURE 2.2. A local trivialization. 


of E, such that the following diagram commutes: 

TT-^U) —^ [/ X 

TT TTi 

U = U 

(where tti is the projection onto the first factor). 

(c) The restriction of (p to each fiber, ip: Ep ^ {p} x R^, is a linear 
isomorphism. 

Whether or not you have encountered the formal definition of vector 
bundles, you have certainly seen at least two examples: the tangent bundle 
TM of a smooth manifold M, which is just the disjoint union of the tangent 
spaces TpM for all p G M, and the cotangent bundle T*M, which is the 
disjoint union of the cotangent spaces T*M = (TpM)*. Another example 
that is relatively easy to visualize (and which we formally define in Chapter 
8) is the normal bundle to a submanifold M C R", whose fiber at each 
point is the normal space NpM, the orthogonal complement of TpM in R". 

It frequently happens that we are given a collection of vector spaces, one 
for each point in a manifold, that we would like to “glue together” to form a 
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vector bundle. For example, this is how the tangent and cotangent bundles 
are defined. There is a shortcut for showing that such a collection forms 
a vector bundle without first constructing a smooth manifold structure on 
the total space. As the next lemma shows, all we need to do is to exhibit 
the maps that we wish to consider as local trivializations and check that 
they overlap correctly. 

Lemma 2.2. Let M be a smooth manifold, E a set, and tt: E ^ M a 
surjective map. Suppose we are given an open covering {[/«} of M together 
with bijective maps ipa- ^ Ua x satisfying ttio (pa = tt, such 

that whenever [/q, fl C/^ yf 0, the composite map 

iPa o : C/a n C/,3 X R'= ^ n X 

is of the form 

Va o V) = {p, t{p)V) (2.4) 

for some smooth map t: Ua r\ Up GL(fc,R). Then E has a unique 
structure as a smooth k-dimensional vector bundle over M for which the 
maps (pa are local trivializations. 

Proof For each p G M, let Ep = 7r“^(p). If p G C/q, observe that the 
map {(pa)p'. Ep {p} X R^ obtained by restricting (pa is a bijection. We 
can define a vector space structure on Ep by declaring this map to be 
a linear isomorphism. This structure is well defined, since for any other 
set Up containing p, (2.4) guarantees that {(pa)p o (</’/3)p^ = t{p) is an 
isomorphism. 

Shrinking the sets C/q and taking more of them if necessary, we may 
assume each of them is diffeomorphic to some open set C/q C R". Following 
Pq with such a diffeomorphism, we get a bijection Tr~^{Ua) ^ C/q x R^, 
which we can use as a coordinate chart for E. Because (2.4) shows that the 
PqS overlap smoothly, these charts determine a locally Euclidean topology 
and a smooth manifold structure on E. It is immediate that each map ipa 
is a diffeomorphism with respect to this smooth structure, and the rest of 
the conditions for a vector bundle follow automatically. □ 

The smooth GL(fc,R) -valued maps t of the preceding lemma are called 
transition functions for E. 

As an illustration, we show how to apply this construction to the tan¬ 
gent bundle. Given a coordinate chart (U, (x*)) for M, any tangent vector 
V G TxM at a, point x G U can be expressed in terms of the coordinate 
basis as y = v^dfdx^ for some n-tuple v = {v^,..., v^). Define a bijection 
ip: TT~^{U) ^ U X R” by sending V G T^M to {x,v). Where two coordi¬ 
nate charts (cc*) and (i*) overlap, the respective coordinate basis vectors 
are related by 

d dx^ d 

dx^ dx^ dxt ’ 
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and therefore the same vector V is represented by 

rr ~i ^ i ^ ^ 

Y — yj -, = V -r = V -r-r . 

dx^ dx^ dx'- dxi 

This means that = v^dx^ /dx^, so the corresponding local trivializations 
ip and p are related by 

p o v) = 'p{V) = (x, v) = (x, t(x)v), 

where t(x) is the GT(n, R) -valued function dx^ jdx^. It is now immediate 
from Lemma 2.2 that these are the local trivializations for a vector bundle 
structure on TM. 

It is useful to note that this construction actually gives explicit coordi¬ 
nates (xbu*) on TT~^{U), which we will refer to as standard coordinates for 
the tangent bundle. 

If TT: if ^ M is a vector bundle over M, a section of if is a map F: M ^ 
E such that tto F = IdM, or, equivalently, F{p) G Ep for all p. It is said to 
be a smooth section if it is smooth as a map between manifolds. The next 
lemma gives another criterion for smoothness that is more easily verified 
in practice. 

Lemma 2.3. Let F : M ^ E be a section of a vector bundle. F is smooth 
if and only if the components FF"'f‘ of F in terms of any smooth local 
frame {Ei} on an open set U G M depend smoothly on p G U. 

Exercise 2.4. Prove Lemma 2.3. 

The set of smooth sections of a vector bundle is an infinite-dimensional 
vector space under pointwise addition and multiplication by constants, 
whose zero element is the zero section f defined hy fp = Q G Ep for all 
p G M. In this book, we use the script letter corresponding to the name 
of a vector bundle to denote its space of sections. Thus, for example, the 
space of smooth sections of TM is denoted T(M); it is the space of smooth 
vector fields on M. (Many books use the notation X(M) for this space, but 
our notation is more systematic, and seems to be becoming more common.) 


Tensor Bundles and Tensor Fields 

On a manifold M, we can perform the same linear-algebraic constructions 
on each tangent space TpM that we perform on any vector space, yielding 
tensors at p. For example, a (j)-tensor at p G M is just an element of 
T^{TpM). We define the bundle of -tensors on M as 

TtM := Jl TtiTpM), 

p&M 
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where ]J denotes the disjoint union. Similarly, the bundle of k-forms is 

A'=M := A'=(TpM). 

p&M 

There are the usual identifications T^M = TM and T^M = A^M = T*M. 

To see that each of these tensor bundles is a vector bundle, define the 
projection tt: T^M ^ M to be the map that simply sends F G Tl^{TpM) 
to p. If (x*) are any local coordinates on U C M, and p G U, the coordinate 
vectors {di} form a basis for TpM whose dual basis is {dx*}. Any tensor 
F e Ti{TpM) can be expressed in terms of this basis as 

F = Fff-'Jl 0 • • • 0 dj^ 0 dx*i (g) • • • (g) dx*'=. 

Exercise 2.5. For any coordinate chart {U, (x*)) on M, define a map 
from -K-^iU) C TfM to U x by sending a tensor F G TfiT^M) to 

(x, {F^f"f‘)) G U X R" . Show that Ti M can be made into a smooth vec¬ 
tor bundle in a unique way so that all such maps are local trivializations. 

A tensor field on M is a smooth section of some tensor bundle Tj^M, 
and a differential k-form is a smooth section of A^M. To avoid confusion 
between the point p G M at which a tensor field is evaluated and the 
vectors and covectors to which it is applied, we usually write the value of a 
tensor field F at p G M as Fp G Ti{TpM), or, if it is clearer (for example if 
F itself has one or more subscripts), as F\p. The space of (j)-tensor fields 
is denoted by Tf(M), and the space of covariant /c-tensor fields (smooth 
sections of T^M) by T^(M). In particular, T^(M) is the space of 1-forms. 
We follow the common practice of denoting the space of smooth real-valued 
functions on M (i.e., smooth sections of T^M) by C°°{M). 

Let {El,... ,En) be any local frame for TM, that is, n smooth vector 
fields defined on some open set U such that {Eil^,... ,En\p) form a basis 
for TpM at each point p G U. Associated with such a frame is the dual 
coframe, which we denote {ip ^,..., these are smooth 1-forms satisfying 
(p'‘{Ej) = Fy In terms of any local frame, a (^)-tensor field E can be written 
in the form (2.2), where now the components Ey"T are to be interpreted 
as functions on U. In particular, in terms of a coordinate frame {9i} and 
its dual coframe {dx*}, F has the coordinate expression 

Fp = Fff -- f‘ (p) (g) • • • (g) dj, (g) dx*i (g) • • • (g) dx**”. (2.5) 

Exercise 2.6. Let F: M —> TfM be a section. Show that A is a smooth 
tensor field if and only if whenever {W} are smooth vector fields and 
{w-’l are smooth 1-forms defined on an open set U C M, the function 
F{iF,... ,Lo\X-i,..., Xk) on U, defined by 

F{u! ,... ,uj , Xi ,..., Xk){p) = Fp{ujp ,..., LUp, Xi\p ,..., Xk\p), 
is smooth. 
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An important property of tensor fields is that they are multilinear over 
the space of smooth functions. Given a tensor field F G vector 

fields Xi € T(M), and 1-forms Exercise 2.6 shows that the 

function F(Xi ,..., ..., w*) is smooth, and thus F induces a map 

F: T^(M) X • • • X J^(M) x 7(M) x • • • x 7(M) C°°(M). 

It is easy to check that this map is multilinear over that is, for 

any functions f,g G C°°{M) and any smooth vector or covector fields a, 

/?, 


F{...,fa + gP,...) = fF{. + gF {..., /3,...). 

Even more important is the converse: as the next lemma shows, any such 
map that is multilinear over C°°{M) defines a tensor field. 

Lemma 2.4. (Tensor Characterization Lemma) A map 

t: 7\M) X ••• X 7\M) x 7{M) x ••• x T(M) ^ C“(M) 

is induced by a -tensor field as above if and only if it is multilinear over 
C°°{M). Similarly, a map 

t: 7\M) X • •• X 7\M) x T(M) x ••• x T(M) ^ T(M) 

is induced by a -tensor field as in Lemma 2.1 if and only if it is 
multilinear over C°°{M). 

Exercise 2.7. Prove Lemma 2.4. 
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Definitions and Examples of 
Riemannian Metrics 


In this chapter we officially define Riemannian metrics and construct some 
of the elementary objects associated with them. At the end of the chap¬ 
ter, we introduce three classes of highly symmetric “model” Riemannian 
manifolds—Euclidean spaces, spheres, and hyperbolic spaces—to which we 
will return repeatedly as our understanding deepens and our tools become 
more sophisticated. 


Riemannian Metrics 

Definitions 

A Riemannian metric on a smooth manifold M is a 2-tensor field g S 
T^(M) that is symmetric (i.e., g{X,Y) = g(Y,X)) and positive definite 
(i.e., g{X, X) > 0 if A 0). A Riemannian metric thus determines an inner 
product on each tangent space TpM, which is typically written {X,Y) := 
g{X, Y) for X,Y G TpM. A manifold together with a given Riemannian 
metric is called a Riemannian manifold. We often use the word “metric” 
to refer to a Riemannian metric when there is no chance of confusion. 

Exercise 3.1. Using a partition of unity, prove that every manifold can 
be given a Riemannian metric. 

Just as in Euclidean geometry, if p is a point in a Riemannian manifold 
(M, g), we define the length or norm of any tangent vector X G TpM to be 
|A| := {X,XfiD. Unless we specify otherwise, we define the angle between 
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two nonzero vectors X,Y G TpM to be the unique 9 G [0,7r] satisfying 
cos 6* = {X,Y)/{\X \ |y|). (Later, we will further refine the notion of angle 
in special cases to allow more general values of 0.) We say that X and Y 
are orthogonal if their angle is tt/2, or equivalently if {X,Y) = 0. Vectors 
Ei,...,Ek are called orthonormal if they are of length 1 and pairwise 
orthogonal, or equivalently if {Ei,Ej) = 6ij. 

If (M, g) and (M, g) are Riemannian manifolds, a diffeomorphism ip from 
M to M is called an isometry if p*g = g. We say {M,g) and {M,g) are 
isometric if there exists an isometry between them. It is easy to verify 
that being isometric is an equivalence relation on the class of Riemannian 
manifolds. Riemannian geometry is concerned primarily with properties 
that are preserved by isometries. 

An isometry p: (M,g) —>■ (M,g) is called an isometry of M. A compo¬ 
sition of isometries and the inverse of an isometry are again isometries, so 
the set of isometries of M is a group, called the isometry group of M; it is 
denoted U(M). (It can be shown that the isometry group is always a finite¬ 
dimensional Lie group acting smoothly on M; see, for example, [Kob72, 
Theorem 11.1.2].) 

If {El, ..., En) is any local frame for TM, and {p^, ..., tp”) is its dual 
coframe, a Riemannian metric can be written locally as 

9 = gij^P" ® e’- 

The coefficient matrix, defined by gij = {Ei,Ej), is symmetric in i and j 
and depends smoothly on p G M. In particular, in a coordinate frame, g 
has the form 


g = gijdx^ ® dxE (3-1) 

The notation can be shortened by introducing the symmetric product of 
two 1-forms uj and rj, denoted by juxtaposition with no product symbol: 

uj-q := I(w (g) p -I- ?7 0 w). 

Because of the symmetry of gij, (3.1) is equivalent to 

g = gijdx^dxE 

Exercise 3.2. Let p be any point in a Riemannian n-manifold {M,g). 
Show that there is a local orthonormal frame near p —that is, a local frame 
El,..., En defined in a neighborhood of p that forms an orthonormal basis 
for the tangent space at each point. [Hint: Use the Gram-Schmidt algorithm. 
Warning: A common mistake made by novices is to assume that one can find 
coordinates near p such that the coordinate vector fields di are orthonormal. 
Your solution to this exercise does not show this. In fact, as we will see in 
Chapter 7, this is possible only when the metric is flat, i.e., locally isometric 
to the Euclidean metric.] 
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Examples 

One obvious example of a Riemannian manifold is R" with its Euclidean 
metric g, which is just the usual inner product on each tangent space TxHE 
under the natural identification TxHE = R". In standard coordinates, this 
can be written in several ways: 

g = ^ dx^'dx^ = = dijdx^dxE (3.2) 

i i 


The matrix of g in these coordinates is thus gij = 6ij. 

Many other examples of Riemannian metrics arise naturally as subman¬ 
ifolds, products, and quotients of Riemannian manifolds. We begin with 
submanifolds. Suppose {M,g) is a Riemannian manifold, and l: M ^ M 
is an (immersed) submanifold of M. The induced metric on M is the 2- 
tensor g = Eg, which is just the restriction of g to vectors tangent to M. 
Because the restriction of an inner product is itself an inner product, this 
obviously defines a Riemannian metric on M. For example, the standard 
metric on the sphere S” C R"+^ is obtained in this way; we study it in 
much more detail later in this chapter. 

Computations on a submanifold are usually most conveniently carried 
out in terms of a local parametrization: this is an embedding of an open 
subset U C R” into M, whose image is an open subset of M. For example, 
if X: If R™ is a parametrization of a submanifold M C R™ with the 
induced metric, the induced metric in standard coordinates (u^,..., u”) on 
If is just 


= Y,{dxr = Y. 


fdX^ 


i=l 


i=l 


V Out 


du^ 


Exercise 3.3. Let 'y{t) = b{t)), t £ I (an open interval), be a smooth 

injective curve in the xz-pla,ne, and suppose a{t) > 0 and 'y{t) 7 ^ 0 for all 
t £ I. Let M C R® be the surface of revolution obtained by revolving the 
image of 7 about the 2 -axis (Figure 3.1). 

(a) Show that M is an immersed submanifold of R®, and is embedded if 
7 is an embedding. 

(b) Show that the map <^{0,1) = {a{t) cos 9, a{t) sin 9, b{t)) from R x / to 
R® is a local parametrization of M in a neighborhood of any point. 

(c) Compute the expression for the induced metric on M in {9, t) coordi¬ 
nates. 

(d) Specialize this computation to the case of the doughnut-shaped torus 
of revolution given by {a{t),b{t)) = {2 + cost,sint). 

Exercise 3.4. The n-torus is the manifold T" ~ x ■ ■ ■ x , considered 
as the subset of R^” defined by (x^)^ -|- (x^)^ = .. . = (x^"”*^)^ -I- (x^")^ = 
1. Show that X{u^,...,u") = (cos sinu^,..., cos m", sin u") gives local 
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parametrizations of T" when restricted to suitable domains, and that the 
induced metric is equal to the Euclidean metric in (w*) coordinates. 

Next we consider products. If (Mi, gi) and (M2, (72) are Riemannian man¬ 
ifolds, the product Mi x M2 has a natural Riemannian metric g = gi(B 32, 
called the product metric, defined by 


g{Xi + X2 ,Yi + Y 2 ) = gi{Xi,Yi) + g2{X2,Y2), ( 3 . 3 ) 


where Xi,Yi G Tp^Mi under the natural identification T/^p^ p^-^Mi x M2 = 
Tp-, Ml 0 Tp2 M2. 

Any local coordinates {x ^,..., x”) for Mi and (x"+^,..., x”’*'™) for M2 
give coordinates (x^,..., x”"*"™) for Mi x M2. In terms of these coordinates, 
the product metric has the local expression g = gijdx^'dx^, where (gij) is 
the block diagonal matrix 


{9^J) 


(0 \ 

V 0 {92)tj) ■ 
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Exercise 3.5. Show that the induced metric on T” described in Exercise 
3.4 is the product metric obtained from the usual induced metric on C 

C^r last class of examples is obtained from covering spaces. Suppose 
tt: M ^ M is a smooth covering map. A covering transformation (or deck 
transformation) is a smooth map ip: M ^ M such that tt o ip = tt. li g is 
a Riemannian metric on M, then g := tt* g is a Riemannian metric on M 
that is invariant under all covering transformations. In this case g is called 
the covering metric, and tt is called a Riemannian covering. 

The following exercise shows the converse: Any metric on M that is 
invariant under all covering transformations descends to M. 

Exercise 3.6. If tt: M ^ M is a smooth covering map, and g is any 
metric on M that is invariant under all covering transformations, show that 
there is a unique metric g on M such that g = TT*g. 

Exercise 3.7. Let T" C R^" denote the n-torus. Show that the map 
X : R” ^ T" of Exercise 3.4 is a Riemannian covering. 

Later in this chapter, we will undertake a much more detailed study of 
three important classes of examples of Riemannian metrics, the “model 
spaces” of Riemannian geometry. Other examples, such as metrics on Lie 
groups and on complex projective spaces, are introduced in the problems 
at the end of the chapter. 


Elementary Constructions Associated with 
Riemannian Metrics 

Raising and Lowering Indices 

One elementary but important property of Riemannian metrics is that they 
allow us to convert vectors to covectors and vice versa. Given a metric g 
on M, define a map called flat from TM to T*M by sending a vector X 
to the covector defined by 

M(y) := g{X,Y). 


In coordinates, 


X'’ = g{X^d,,-)=gijX^dxL 

It is standard practice to write X^ in coordinates as X^ = XjdxR where 
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One says that is obtained from X by lowering an index. (This is why 
the operation is designated by the musical notation b = “flat.”) 

The matrix of flat in terms of a coordinate basis is therefore the matrix 
of g itself. Since the matrix of g is invertible, so is the flat operator; we 
denote its inverse by (what else?) w called sharp. In coordinates, 

has components 

w* := g^^Uj, 

where, by definition, 5 *? ^re the components of the inverse matrix {gij)~^. 
One says is obtained by raising an index. 

Probably the most important application of the sharp operator is to 
extend the classical gradient operator to Riemannian manifolds. If / is a 
smooth, real-valued function on a Riemannian manifold (M, g), the gradient 
of / is the vector held grad / := obtained from df by raising an index. 
Looking through the definitions, we see that grad / is characterized by the 
fact that 


df{Y) = (grad/, F) for all Y G TM, 
and has the coordinate expression 

grad / = g^^djdj. 

The flat and sharp operators can be applied to tensors of any rank, in 
any index position, to convert tensors from covariant to contravariant or 
vice versa. For example, if B is again the 3-tensor with components given 
by (2.3), we can lower its middle index to obtain a covariant 3-tensor 
with components 

ddijk •— 9jlhdi k- 

In coordinate-free notation, this is just 

B\X,Y,Z) := B{X,Y\Z). 

(Of course, if a tensor has more than one upper index, the flat notation 
doesn’t tell us which one to lower. In such cases, we have to explain in 
words what is meant.) 

Another important application of the flat and sharp operators is to ex¬ 
tend the trace operator introduced in Chapter 2 to covariant tensors. We 
consider only symmetric 2 -tensors here, but it is easy to extend these results 
to more general tensors. 

If ft, is a symmetric 2-tensor on a Riemannian manifold, then ft’^ is a (J)- 
tensor and therefore tr is defined. We define the trace of ft with respect 
to g as 

trg ft := tr ft^. 
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(Because h is symmetric, it doesn’t matter which index is raised.) In terms 
of a basis, this is 

tvgh=hi^ = g'^hij. 

In particular, in an orthonormal basis this is the ordinary trace of a matrix. 


Inner Products of Tensors 

A metric is by definition an inner product on tangent vectors. As the fol¬ 
lowing lemma shows, it determines an inner product (and hence a norm) 
on all tensor bundles as well. First a bit of terminology: If if ^ M is a 
vector bundle, a fiber metric on E is an inner product on each fiber Ep 
that varies smoothly, in the sense that for any (local) smooth sections tr, r 
of E, the inner product {a, r) is a smooth function. 

Lemma 3.1. Let g he a Riemannian metric on a manifold M. There is 
a unique fiber metric on each tensor bundle TfM with the property that 
if {El ,..., En) is an orthonormal basis for TpM and {ip ^,..., (/?") is the 
corresponding dual basis, then the collection of tensors given by (2.1) forms 
an orthonormal basis for Tf{TpM). 


Exercise 3.8. Prove Lemma 3.1 by showing that in any local coordinate 
system, the required inner product is given by 




...g. 


'‘9J^s,■■■gns^FflXG^ 


...Si 

...rk • 


Show moreover that if rj are covariant 1-tensors, then 

( 01 , 77 ) = {w*,g*). 


The Volume Element and Integration 

The final general construction we will study before looking at specific ex¬ 
amples of metrics is the volume element. 

Lemma 3.2. On any oriented Riemannian n-manifold {M,g), there is a 
unique n-form dV satisfying the property that dV{Ei, ..., £"„) = 1 when¬ 
ever {El ,..., En) is an oriented orthonormal basis for some tangent space 
TpM. 

This n-form dV (sometimes denoted dVg for clarity) is called the {Rie¬ 
mannian) volume element. 

Exercise 3.9. Prove Lemma 3.2, and show that the expression for dV 
with respect to any oriented local frame {Ei} is 

dV = \/det{gij) A ■■■ A <p", 

where gij = {Ei,Ej) are the coefficients of g and {p} is the dual coframe. 
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The significance of the Riemannian volume element is that it allows us 
to integrate functions, not just differential forms. If / is a smooth, com¬ 
pactly supported function on an oriented Riemannian n-manifold {M,g), 
then / dV is a compactly supported n-form. Therefore the integral / dV 
makes sense, and we define it to be the integral of f over M. Similarly, the 
volume of M is defined to be dV = 1 dV. 


Generalizations of Riemannian Metrics 

There are other common ways of measuring “lengths” of tangent vectors on 
smooth manifolds. Let’s digress briefly to mention three that play impor¬ 
tant roles in other branches of mathematics: pseudo-Riemannian metrics, 
sub-Riemannian metrics, and Finsler metrics. Each is defined by relaxing 
one of the requirements in the definition of Riemannian metric: a pseudo- 
Riemannian metric is obtained by relaxing the requirement that the metric 
be positive; a sub-Riemannian metric by relaxing the requirement that it 
be defined on the whole tangent space; and a Finsler metric by relaxing 
the requirement that it be quadratic on each tangent space. 

Pseudo-Riemannian Metries 

A pseudo-Riemannian metric (occasionally also called a semi-Riemann- 
ian metric) on a smooth manifold M is a symmetric 2-tensor field g that 
is nondegenerate at each point p G M. This means that the only vector 
orthogonal to everything is the zero vector. More formally, g{X,Y) = 0 
for all Y G TpM if and only if A = 0. If g = gijpip^ in terms of a local 
coframe, nondegeneracy just means that the matrix gij is invertible. If g is 
Riemannian, nondegeneracy follows immediately from positive-definiteness, 
so every Riemannian metric is also a pseudo-Riemannian metric; but in 
general pseudo-Riemannian metrics need not be positive. 

Given a pseudo-Riemannian metric g and a point p G M, by a sim¬ 
ple extension of the Gram-Schmidt algorithm one can construct a basis 
{El,, En) for TpM in which g has the expression 

5 = _(/)2 - (^^)2 + (^.+ 1)2 ^ ^ (3 4 ) 

for some integer 0 < r < n. This integer r, called the index of g, is equal 
to the max;imum dimension of any subspace of TpM on which g is negative 
definite. Therefore the index is independent of the choice of basis, a fact 
known classically as Sylvester’s law of inertia. 

By far the most important pseudo-Riemannian metrics (other than the 
Riemannian ones) are the Lorentz metrics, which are pseudo-Riemannian 
metrics of index 1. The most important example of a Lorentz metric is the 
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Minkowski metric; this is the Lorentz metric m on that is written in 

terms of coordinates ..., r) as 

m={d^^f + --- + {dCf-{dTf. (3.5) 

In the special case of R^, the Minkowski metric is the fundamental invariant 
of Einstein’s special theory of relativity, which can be expressed succinctly 
by saying that in the absence of gravity, the laws of physics have the same 
form in any coordinate system in which the Minkowski metric has the 
expression (3.5). The differing physical characteristics of “space” (the ^ 
directions) and “time” (the r direction) arise from the fact that they are 
subspaces on which g is positive definite and negative definite, respectively. 
The general theory of relativity includes gravitational effects by allowing 
the Lorentz metric to vary from point to point. 

Many aspects of the theory of Riemannian metrics apply equally well to 
pseudo-Riemannian metrics. Although we do not treat pseudo-Riemannian 
geometry directly in this book, we will attempt to point out as we go along 
which aspects of the theory apply to pseudo-Riemannian metrics. As a 
rule of thumb, proofs that depend only on the invertibility of the metric 
tensor, such as existence and uniqueness of the Riemannian connection and 
geodesics, work fine in the pseudo-Riemannian setting, while proofs that use 
positivity in an essential way, such as those involving distance-minimizing 
properties of geodesics, do not. 

For an introduction to the mathematical aspects of pseudo-Riemannian 
metrics, see the excellent book [0’N83]; a more physical treatment can be 
found in [HE73]. 


Sub-Riemannian Metrics 

A sub-Riemannian metric (also sometimes known as a singular Riemannian 
metric or Carnot-Caratheodory metric) on a manifold M is a, fiber metric 
on a smooth distribution S C TM (i.e., a fc-plane field or sub-bundle of 
TM). Since lengths make sense only for vectors in S', the only curves whose 
lengths can be measured are those whose tangent vectors lie everywhere 
in S. Therefore one usually imposes some condition on S that guarantees 
that any two nearby points can be connected by such a curve. This is, in 
a sense, the opposite of the Frobenius integrability condition, which would 
restrict every such curve to lie in a single leaf of a foliation. 

Sub-Riemannian metrics arise naturally in the study of the abstract mod¬ 
els of real submanifolds of complex space C", called CR manifolds. (Here 
CR stands for “Cauchy-Riemann.”) CR manifolds are real manifolds en¬ 
dowed with a distribution S G TM whose fibers carry the structure of com¬ 
plex vector spaces (with an additional integrability condition that need not 
concern us here). In the model case of a submanifold M C C", S is the set of 
vectors tangent to M that remain tangent after multiplication hy i = 1 
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in the ambient complex coordinates. If S is sufficiently far from being in- 
tegrable, choosing a fiber metric on S results in a sub-Riemannian metric 
whose geometric properties closely reflect the complex-analytic properties 
of M as a subset of C”. 

Another motivation for studying sub-Riemannian metrics arises from 
control theory. In this subject, one is given a manifold with a vector held 
depending on parameters called controls, with the goal being to vary the 
controls so as to obtain a solution curve with desired properties, often 
one that minimizes some function such as arc length. If the vector held is 
everywhere tangent to a distribution S on the manifold (for example, in 
the case of a robot arm whose motion is restricted by the orientations of 
its hinges), then the function can often be modeled as a sub-Riemannian 
metric and optimal solutions modeled as sub-Riemannian geodesics. 

A useful introduction to the geometry of sub-Riemannian metrics is pro¬ 
vided in the article [Str86]. 


Finsler Metrics 

A Finsler metric on a manifold M is a continuous function F: TM ^ R, 
smooth on the complement of the zero section, that defines a norm on 
each tangent space TpM. This means that F{X) > 0 for A yf 0, F(cX) = 
\c\F{X) for c € R, and F{X -I- T) < F{X) + F(Y). Again, the norm 
function associated with any Riemannian metric is a special case. 

The inventor of Riemannian geometry himself, G. F. B. Riemann, clearly 
envisaged an important role in n-dimensional geometry for what we now 
call Finsler metrics; he restricted his investigations to the “Riemannian” 
case purely for simplicity (see [Spi79, volume 2]). However, only very re¬ 
cently have Finsler metrics begun to be studied seriously from a geometric 
point of view—see [Che96] for a survey of recent progress in the differential- 
geometric investigation of Finsler metrics. 

The recent upsurge of interest in Finsler metrics has been motivated 
largely by the fact that two different Finsler metrics appear very naturally 
in the theory of several complex variables: at least for bounded strictly 
convex domains in C", the Kohayashi metric and the Caratheodory met¬ 
ric are intrinsically defined, biholomorphically invariant Finsler metrics. 
Combining differential-geometric and complex-analytic methods has led to 
striking new insights into both the function theory and the geometry of 
such domains. We do not treat Finsler metrics further in this book, but 
you can consult one of the recent books on the subject (e.g. [AP94, JP93]) 
or the references cited in [Che96]. 
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The Model Spaces of Riemannian Geometry 

Before we delve into the general theory of Riemannian manifolds, let’s 
give it some substance by introducing three classes of highly symmetric 
“model spaces” of Riemannian geometry—Euclidean space, spheres, and 
hyperbolic spaces. For much more information on the material covered in 
this section, see [Wol84]. 


Euclidean Space 

The simplest and most important model Riemannian manifold is of course 
R" itself, with the Euclidean metric g given by (3.2). More generally, if V is 
any n-dimensional vector space endowed with an inner product, we can set 
g{X, Y) = {X, Y) for any X,Y G TpV = V. Choosing an orthonormal basis 
{El ,..., En) for V defines a map from R” to V by sending {x^,..., x”) to 
x^Ei] this is easily seen to be an isometry of {V,g) with (R",^). 


Spheres 

Our second model space is the sphere of radius R in R”+^, denoted S^, 
with the metric 5 ^ induced from the Euclidean metric on R"“'"^, which we 
call the round metric of radius R. (When R = 1, this is simply called the 
round metric, and we’ll use the notations S" and g.) 

One of the first things one notices about the spheres is that they are 
highly symmetric. To describe the symmetries of the sphere, we introduce 
some standard terminology. Let M be a Riemannian manifold. First, M 
is a homogeneous Riemannian manifold if it admits a Lie group acting 
smoothly and transitively by isometries. Second, given a point p G M, M 
is isotropic at p if there exists a Lie group G acting smoothly on M by 
isometries such that the isotropy subgroup Gp C G (the subgroup of ele¬ 
ments of G that fix p) acts transitively on the set of unit vectors in TpM 
(where g G Gp acts on TpM by 5 *: TpM TpM). Clearly a homogeneous 
Riemannian manifold that is isotropic at one point is isotropic at every 
point; in that case, one says M is homogeneous and isotropic. A homoge¬ 
neous Riemannian manifold looks geometrically the same at every point, 
while an isotropic one looks the same in every direction. 

We can immediately write down a large group of isometries of by 
observing that the linear action of the orthogonal group 0{n+ 1 ) on R"+^ 
preserves and the Euclidean metric, so its restriction to acts by 
isometries of the sphere. (Later we’ll see in fact that this is the full isometry 
group, but we don’t need that fact now.) 

Proposition 3.3. 0{n+l) acts transitively on orthonormal bases on S^. 
More precisely, given any two points p,p G S^, and orthonormal bases {Ei} 
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FIGURE 3.2. Transitivity of 0(n + 1) on orthonormal bases. 


for TpS^ and {Ei} for TpS'f^, there exists (p G 0(ji+ 1) such that (p{p) = p 
and ip^Ei = Ei. In particular, is homogeneous and isotropic. 

Proof. It suffices to show that given any p and any orthonormal 

basis {Ei} for TpS^, there is an orthogonal map that takes the “north 
pole” = (0 ,..., 0, i?) to p and the standard basis {9i} for to {Ei}. 

To do so, think of p as a vector of length R in R"+^, and let p = p/R 
denote the corresponding unit vector (Figure 3.2). Since the basis vectors 
{Ei} are tangent to the sphere, they are orthogonal to p, so {Ei ,..., En,p) 
is an orthonormal basis for R”+b Let a be the matrix whose columns 
are these basis vectors. Then a G 0(n + 1), and by elementary linear 
algebra a takes the standard basis vectors (9i,..., 9„+i) to {Ex,..., En,p). 
In particular, a(0,..., 0, i?) = p. Moreover, since a acts linearly on R"+^, 
its push-forward is represented in standard coordinates by the same matrix, 
so a^,^i = Ei ior i = 1,... ,n, and a is the desired orthogonal map. □ 


'T^€Lth.C.I*L€Ltl.C.€LL "Ph-^SjLIILA. 



The Model Spaces of Riemannian Geometry 35 


Another important feature of the sphere—one that is much less evident 
than its symmetry—is that it is locally conformally equivalent to Euclidean 
space, in a sense that we now describe. Two metrics gi and on a manifold 
M are said to be conformal to each other if there is a positive function 
/ € C°°{M) such that 52 = fgi- Two Riemannian manifolds {M,g) and 
{M,g) are said to be conformally equivalent if there is a diffeomorphism 
ip: M ^ M such that (p*g is conformal to g. 


Exercise 3.10. (a) Show that two metrics are conformal if and only if 

they define the same angles but not necessarily the same lengths. 

(b) Show that a diffeomorphism is a conformal equivalence if and only if 
it preserves angles. 


A conformal equivalence between R" and the sphere C minus 

a point is provided by stereographic projection from the north pole. This is 
the map ct: — {N} R” that sends a point P € — {N} C R"+^, 

written P = ..., t), to u G R", where U = ..., u", 0) is the 

point where the line through N and P intersects the hyperplane {r = 0} in 

R"+i (Figure 3.3). Thus U is characterized by the fact that NU = XNP 
for some nonzero scalar A. Writing N = {0, R), U = {u, 0), and P = {f, r) G 
Rn+l ^ Rn X R, this leads to the system of equations 


= xe, 

-R = A(r-P). 


(3.6) 


Solving the second equation for A and plugging it into the first equation, 
we get the formula for stereographic projection 

cr(^,r) = M =(3.7) 

Clearly a is defined and smooth on all of — {N}. The easiest way to 
see that it is a diffeomorphism is to compute its inverse. Solving the two 
equations of (3.6) for t and f’' gives 


C = J, r = R^. (3.8) 

The point P = a~^{u) is characterized by these equations and the fact that 
P is on the sphere. Thus, substituting (3.8) into = Rf gives 


i^r 

A 2 


R 


(A-1)^ 

A 2 


= R\ 


from which we conclude 


A = 


\u\^ + Rf 

TR2 
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FIGURE 3.3. Stereographic projection. 


Inserting this back into (3.8) gives the formula 


- 1 /^ (f \ ( ‘2R^u 

which by construction maps R" back to — 
diffeomorphism. 


\u\^ — R^\ 
\u\^ + Ry ’ 


(3.9) 


{N} and shows that cr is a 


Lemma 3.4. Stereoqraphic projection is a conformal equivalence between 
S]^-{N} and R”. 


Proof. The inverse map cr~^ is a local parametrization, so we will use it to 
compute the pullback metric. Consider an arbitrary point q G R" and a 
vector V G TgR”, and compute 




where g denotes the Euclidean metric on R"+^. Writing V = V^di and 
(t“^(m) = (^(u),t(u)), the usual formula for the push-forward of a vector 
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can be written 


Now 


d 

du^ 


a~^V = V^ — — + V^ — — 

^ r\„ r\ 


du^ dr 


d 


d 




ae 


dr' 


= V 


2R^u^ 

M|2 + i?2 


2R^V^ 

|w| 2 +i?- 


4R^ui{V,u) 

(|u|2 + i?2)2’ 


Vt = V 



|up — R?\ 

|u|2 + i?2 ) 


2R{V,u) 2R{\u\^ - R^){V,u) 

|M|2+i?2 (|M|2+i?2)2 

4:R^{V,u) 

(|u| 2 +i? 2 ) 2 ’ 


where we have used the notation V{\u\^) = 2^^V'^u'^ = 
fore, 


2{V,u}. There- 


-9{a-^V,a-W) = Y^iVer + {Vrf 
i=i 

4i?4|y|2 16i?‘‘(y,u)2 16i?4|up(t7,u)2 

“ (|u|2 + i?2)2 - (|y|2 + ^2)3 + (|y|2 + ^2)4 

^ (|w|2+i?2)4 
4i?4|y|2 

“ (|M|2+i?2)2- 

In other words, 

where now g represents the Euclidean metric on R", and so cr is a conformal 
equivalence. □ 


It follows immediately from this lemma that the sphere is locally confor¬ 
mally flat; i.e., each point p has a neighborhood that is conformally 
equivalent to an open set in R”. Stereographic projection gives such an 
equivalence for a neighborhood of any point except the north pole; apply¬ 
ing a suitable rotation and then stereographic projection (or stereographic 
projection from the south pole), we get such an equivalence for a neighbor¬ 
hood of the north pole as well. 
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Hyperbolic Spaces 

Our third class of model Riemannian manifolds is the hyperbolic spaces 
of dimension n. For each i? > 0 we will describe a homogeneous, isotropic 
Riemannian manifold H^, called hyperbolic space of radius R, analogous to 
the sphere of radius R. The special case i? = 1 is denoted H” and is called 
simply hyperbolic space. There are three equivalent models of the hyperbolic 
spaces, each of which is useful in certain contexts. We’ll introduce all of 
them and show that they are isometric. 

Proposition 3.5. For any fixed R > Q, the following Riemannian mani¬ 
folds are all mutually isometric. 

(a) (Hyperboloid model) is the “upper sheet” {r > 0} of the two- 
sheeted hyperboloid in defined in coordinates ... ,^",r) by 

the equation H = R?, with the metric 

h]^ = t*m, 

where t: is inclusion, and m is the Minkowski metric 

(3.5) on R”+b 


(&) (Poincare ball model) is the ball of radius R in R", with the 
metric given in coordinates {u ^,..., m”) by 


hi = 4i?4 


{du^Y + • • • + {du^Y 

(i?2_ |u|2)2 


(c) (Poincare half-space model) is the upper half-space in R” 
defined in coordinates (x^,... ,x^~^,y) by {y > 0}, with the metric 


hi = 


(dx^)^-I-• • •-I-(dx" ^p-i-dy' 


Proof. We begin by giving a geometric construction of a diffeomorphism 


^ . Tun , T>n 


from the hyperboloid to the ball, which we call hyperbolic stereographic 
projection, and which turns out to be an isometry between the two metrics 
given in (a) and (b). 

Let S G R”+^ denote the point S = (0,..., 0, —R). For any P = 
G C R"+\ set 7r(P) = m g B^, where U = (u,0) G 
R"+^ is the point where the line through S and P intersects the hyper¬ 
plane {r = 0} (Figure 3.4). U is characterized by SU = XSP for some 
nonzero scalar A, or 


H = Af, 

R = X{t + R). 


(3.11) 
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FIGURE 3.4. Hyperbolic stereographic projection. 


These equations can be solved in the same manner as in the spherical case 
to yield 


7r(^,T) =u = 


i?+T’ 


and its inverse map 


TT ^(m) = 
We will show that 

{n-rh],{V,V) = 


, , 2R^u 

U,t) = I ——,R 


R^-\u\^’ R^-\u\' 

)*h]i = h\. As before, let V G and compute 

= m(TT-^V,7r-^V). 
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The computation proceeds just as before. In this case, the relevant equa¬ 
tions are 

, 2R'^V^ m^u^{V,u) 

“ i?2-|u|2 + (i^2_|y|2)2; 

(i?2_|u|2)2’ 

n 

- (VrY 
i=i 

4i?4|]/|2 
“ (i?2 _ |u|2)2 

= hUv,v). 

Incidentally, this argument also shows that is positive definite, and 
thus is indeed a Riemannian metric, a fact that was not evident from the 
defining formula due to the fact that m is not positive definite. 

Next we consider the Poincare half-space model, by constructing an ex¬ 
plicit diffeomorphism 


:: S’: 


U 


n 

R' 


In this case it is more convenient to write the coordinates on the ball as 
(u^,..., v) = {u,v). In the 2-dimensional case, n is easy to write 
down in complex notation w = u + iv and 2 ; = a; -I- zy. It is a variant of the 
classical Cayley transform: 

= z = (3.12) 

w — iR 


It is shown in elementary complex analysis courses that this is a complex- 
analytic diffeomorphism taking B|j onto U|j. Separating z into real and 
imaginary parts, this can also be written in real terms as 


k{u,v) = {x,y) 


( 2R?u R^ — |up — \ 

\\u\^ + {v-Ry'^\u\^ + {v-Ry)' 


This same formula makes sense in any dimension, and obviously maps the 
ball {|mP + < i?^} into the upper half-space. It is straightforward to 

check that its inverse is 


K ^{x,y) = {u,v) 


( 2R2^ |^|2+|^|2-^2 X 

\\x\^ + {y + Ry^ \x\^ + {y + R)^ ) ’ 


so K is a diffeomorphism, called the generalized Cayley transform. The 
verification that K*h^ = is basically a long calculation, and is left to 
the reader. □ 
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Exercise 3.11. Prove that K*h\ — h\. Here are three different ways you 
might wish to proceed: 


(i) Compute k^V) directly, as in the proof of Proposition 3.5. 

(ii) Show that k is the restriction to the ball of the map ao po where 

a: R” is stereographic projection and p: is the 90° 

rotation 


//-I /-n — 1 /-n \ //-I /-n — 1 r^n\ 


taking the hemisphere {r < 0} to the hemisphere {5" > 0}. This 
shows that k is a conformal map, and therefore it suffices to show that 
h^{KtV, KtV) — h^{V, V) for a single strategically chosen vector V at 
each point. Do this for V = d/dv. 

(iii) If you know some complex analysis, first do the 2-dimensional case 
using the complex form (3.12) of k: Compute the pullback in complex 
notation, by noting that 


h% = R 


2 


dz dz 
(Im2)2’ 


, dw dw 

4i? - 

(R2-|W|2)2’ 


and using the fact that a holomorphic diffeomorphism a = F{w) is a 
conformal map with F*{dzdz) = \F'{w)\'^dw dw. Then show that the 
computation of h^{KtV, KtV) in higher dimensions can be reduced to 
the 2-dimensional case, by conjugating n with a suitable orthogonal 
transformation in n — 1 variables. 


We often use the generic notation to refer to any one of the mani¬ 
folds of Proposition 3.5, and Hr to refer to the corresponding metric, using 
whichever model is most convenient for the application we have in mind. For 
example, the form of the metric in either the ball model or the half-space 
model makes it clear that the hyperbolic metric is locally conformally flat; 
indeed, in either model, the identity map gives a global conformal equiva¬ 
lence with an open subset of Euclidean space. 

The symmetries of are most easily seen in the hyperboloid model. Let 
0{n, 1) denote the group of linear maps from to itself that preserve 

the Minkowski metric. (This is called the Lorentz group in the physics 
literature.) Note that each element of 0{n, 1) preserves the set {r^ — = 

i?^}, which has two components determined by {t > 0} and {r < 0}. We 
let 0+{n, 1) denote the subgroup of 0{n, 1) consisting of maps that take the 
component {t > 0} to itself. Clearly 0+(n, 1) preserves H^, and because 
it preserves m it acts on as isometries. 

Proposition 3.6. 0+(n, 1) acts transitively on the set of orthonormal 
bases on H^, and therefore is homogeneous and isotropic. 

Proof. The argument is entirely analogous to the proof of Proposition 3.3, 
so we give only a sketch. If p G and {Ei} is an orthonormal basis for 
TpH^, an easy computation shows that {Ei,..., E„, E„+i = p/R} is a 
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FIGURE 3.5. 0+{n, 1) acts transitively on orthonormal bases on 


basis for m has the following expression in terms of the 

dual basis: 


It follows easily that the matrix whose columns are the EiS is an element 
of 0+(n, 1) sending N = (0,..., 0, i?) to p and di to Ei (Figure 3.5). □ 


Exercise 3.12. The spherical and hyperbolic metrics come in families gj^, 
Hr, parametrized by a positive real number R. We could have also defined 
a family of metrics on by 

Qr = R?Sijdx'dE. 

Why did we not bother? 
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Problems 


3-1. Suppose {M,g) is aRiemannian m-manifold, M C M is an embedded 
n-dimensional submanifold, and g is the induced Riemannian metric 
on M. For any point p € M, show that there is a neighborhood IX of p 
in M and a smooth orthonormal frame {Ei,..., Em) on IX such that 
{El,..., En) form an orthonormal basis for TqM at each q € IX n M. 
Any such frame is called an adapted orthonormal frame. [Hint: Apply 
the Gram-Schmidt algorithm to the coordinate frame {9i} in slice 
coordinates.] 

3-2. Suppose p is a pseudo-Riemannian metric on an n-manifold M. For 
any p G M, show there is a smooth local frame {Ei ,..., En) defined 
in a neighborhood of p such that g can be written locally in the form 
(3.4). Conclude that the index of g is constant on each component of 

M. 


3-3. Let {M,g) be an oriented Riemannian manifold with volume element 
dV. The divergence operator div: 7{M) C°°{M) is defined by 

d{ixdV) = {div X)dV, 

where ix denotes interior multiplication by X\ for any fc-form ui, ixco 
is the {k — l)-form defined by 

ixiviVi,Vk-i) = co(X, Vi,..., Vk-i). 


(a) Suppose M is a compact, oriented Riemannian manifold with 
boundary. Prove the following divergence theorem for X G 
7{M): 


divXdV= / {X,N)dV, 


IM 


IdM 


where N is the outward unit normal to DM and dV is the Rie¬ 
mannian volume element of the induced metric on dM. 

(b) Show that the divergence operator satisfies the following product 
rule for a smooth function u G C°°{M): 


div(MA) = u div A -|- (grad m, A), 


and deduce the following “integration by parts” formula: 


IM 


(grad u. A) dP = — / u div X dV + / u{X,N)dV. 


IM 


IdM 
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3-4. Let (M, g) be a compact, connected, oriented Riemannian manifold 
with boundary. For u G the Laplacian of u, denoted Aw, is 

defined to be the function Au = div(gradM). A function u G C°°(M) 
is said to be harmonic if Au = 0. 


(a) Prove Green’s identities: 


uAvdV + / (grad u, grad u) dP = / uNvdV. 


IM 


IM 


IdM 


{uAv — vAu)dV= / {uNv — vNu)dV. 


IM 


IdM 


(b) If dM yf 0, and u, v are harmonic functions on M whose restric¬ 
tions to dM agree, show that u = v. 

(c) If dM = 0, show that the only harmonic functions on M are the 
constants. 


3-5. Let M he & compact oriented Riemannian manifold (without bound¬ 
ary). A real number A is called an eigenvalue of the Laplacian if 
there exists a smooth function u on M, not identically zero, such 
that Aw = Aw. In this case, u is called an eigenfunction correspond¬ 
ing to A. 

(a) Prove that 0 is an eigenvalue of A, and that all other eigenvalues 
are strictly negative. 

(b) If w and v are eigenfunctions corresponding to distinct eigenval¬ 
ues, show that uvdV = 0. 

3-6. Consider R" as a Riemannian manifold with the Euclidean metric. 


(a) Let E{n) be the set of (n-l- 1) x (n-l- 1) real matrices of the form 

A b\ 

0 ij’ 

where A G 0(n) and b G R" (considered as a column vector). 
Show that E{n) is a closed Lie subgroup of GL{n+l, R), called 
the Euclidean group or the group of rigid motions. 

(b) Define a map E{n) x R" ^ R” by identifying R" with the 
subset 


S = {{x, 1) G R"+i : cc G R”} 

of R"+^ and restricting the linear action of E{n) on R"+^ to S. 
Show that this is a smooth action of E{n) on R" by isometries 
of the Euclidean metric. 
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(c) Show that E{n) acts transitively on R”, and takes any ortho¬ 
normal basis to any other one, so Euclidean space is homoge¬ 
neous and isotropic. 

3-7. Let denote the hyperbolic plane, i.e., the upper half-plane in 
with the metric h = {dx^ + dy"^)/y'^. Let S'L(2,R) denote the group 
of 2 X 2 real matrices of determinant 1. 

(a) Considering as a subset of the complex plane with coordinate 
z = X + iy, let 

. az + b fa b\ 

.4...= —, .4=1^^ ,jcSL(2,R). 

Show that this defines a smooth action of SL{2, R) on U2 by 
isometries of the hyperbolic metric. 

(b) We have seen that 0+(2,1) also acts on by isometries. Show 
thatS'L(2,R)/{±/} ^ S'0+(2,1), where 5'0+(2,l) = 0+(2,l)n 
S'L(3,R). 

3-8. Suppose M and M are smooth manifolds, and p: M ^ M is ^sur¬ 
jective submersion. For any y G M, the fiber over y, denoted My, is 
the inverse image p~"^{y) C M; it is a closed, embedded submanifold 
by the implicit function theorem. If M has a Riemannian metric g, 
at each point x G M the tangent space T^M decomposes into an 
orthogonal direct sum 

T,,M = © 14 , 

where 14 := Kerp* = is the vertical space and Hx '■= Vf- is 

the horizontal space. If p is a Riemannian metric on M, p is said to 
be a Riemannian submersion if g{X,Y) = g{pi,X,pYY) whenever X 
and Y are horizontal. 

(a) Show that any vector field W on M can be written uniquely as 

W = + W^, where is horizontal, is vertical, and 

both and are smooth. 

(b) If X is a vector field on M, show there is a unique smooth 
horizontal vector field X on M, called the horizontal lift of X, 
that is p-related to X. (This means p*Xq = ^p(q) for each q G 
M.) 

(c) Let G be a Lie group acting smoothly on M by isometries of g, 
and suppose that p = p for all (p G G and that G acts transi¬ 
tively on each fiber My. Show that there is a unique Riemannian 
metric g on M such that p is a Riemannian submersion. [Hint: 
First show that ip*I4 = for any p G G.] 
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3-9. The complex projective space of dimension n, denoted CP", is defined 
as the set of 1-dimensional complex subspaces of C"+^. Let tt : C"+^ — 
{0} —> CP” denote the quotient map. 

(a) Show that CP” can be uniquely given the structure of a smooth, 
compact, real 2n-dimensional manifold on which the Lie group 
U{n+1) acts smoothly and transitively. 

(b) Show that the restriction of tt to C C”+^ is a surjective 

submersion. 

(c) Using Problem 3-8, show that the round metric on de¬ 

scends to a homogeneous and isotropic Riemannian metric on 
CP", called the Fuhini-Study metric. 

3-10. Let G be a Lie group with Lie algebra g. A Riemannian metric g on G 
is said to be left-invariant if it is invariant under all left translations: 
L*g = g for all p G G. Similarly, g is right-invariant if it is invariant 
under all right translations, and bi-invariant if it is both left- and 
right-invariant. 

(a) Show that a metric g is left-invariant if and only if the coefficients 
gij := g(Xi,Xj) of g with respect to any left-invariant frame 
{Xi} are constants. 

(b) Show that the restriction map g i-^- g\rp^Q gives a bijection be¬ 
tween left-invariant metrics on G and inner products on g. 

3-11. Suppose G is a compact, connected Lie group with a left-invariant 
metric g, and let dV denote the Riemannian volume element of g. 
Show that dV is bi-invariant. [Hint: Show that R*dV is left-invariant 
and positively oriented, and is therefore equal to ip{p)dV for some 
positive number ip{p). Show that ip: G —>■ is a Lie group homo¬ 
morphism, so its image is a compact subgroup of R”*".] 

3-12. If G is a Lie group and p G G, conjugation by p gives a Lie 
group automorphism Gp-. G ^ G, called an inner automorphism, 
by Cp{q) = pqp~^■ Let Adp := (Gp)*: g ^ g be the induced Lie al¬ 
gebra automorphism. It is easy to check that Gp^ o Gp^ = Cp^p^, so 
Ad: G X g ^ g is a representation of G, called the adjoint represen¬ 
tation. 

(a) Show that an inner product on g induces a bi-invariant metric 
on G as in Problem 3-10 if and only if it is invariant under the 
adjoint representation. 

(b) Show that every compact, connected Lie group admits a bi¬ 
invariant Riemannian metric. [Hint: Start with an arbitrary in¬ 
ner product (•,•) on g and integrate the function / defined by 
f{p) := (Adp X, Adp Y) over the group. You will need to use the 
result of Problem 3-11.[ 
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Connections 


Before we can define curvature on Riemannian manifolds, we need to study 
geodesics, the Riemannian generalizations of straight lines. It is tempting 
to define geodesics as curves that minimize length, at least between nearby 
points. However, this property turns out to be technically difficult to work 
with as a definition, so instead we’ll choose a different property of straight 
lines and generalize that. 

A curve in Euclidean space is a straight line if and only if its acceleration 
is identically zero. This is the property that we choose to take as a defining 
property of geodesics on a Riemannian manifold. To make sense of this 
idea, we’re going to have to introduce a new object on manifolds, called a 
connection—essentially a coordinate-invariant set of rules for taking direc¬ 
tional derivatives of vector fields. 

We begin this chapter by examining more closely the problem of finding 
an invariant interpretation for the acceleration of a curve, as a way to 
motivate the definitions that follow. We then give a rather general definition 
of a connection, in terms of directional derivatives of sections of vector 
bundles. The special case in which the vector bundle is the tangent bundle 
is called a “linear connection,” and it is on this case that we focus most 
of our attention. After deriving some basic properties of connections, we 
show how to use one to differentiate vector fields along curves, to define 
geodesics, and to “parallel translate” vector fields along curves. 
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FIGURE 4.1. Euclidean coordinates. FIGURE 4.2. Polar coordinates. 


The Problem of Differentiating Vector Fields 

To see why we need a new kind of differentiation operator, consider a 
submanifold M C R" with the induced Riemannian metric, and a smooth 
curve 7 lying entirely in M. We want to think of a geodesic as a curve in M 
that is “as straight as possible.” An intuitively plausible way to measure 
straightness is to compute the Euclidean acceleration ^{t) as usual, and 
orthogonally project ^{t) onto the tangent space This yields a 

vector 7 (t)^ tangent to M, the tangential acceleration of 7 . We could then 
define a geodesic as a curve in M whose tangential acceleration is zero. 
This definition is easily seen to be invariant under rigid motions of R", 
although at this point there is little reason to believe that it is an intrinsic 
invariant of M (one that depends only on the Riemannian geometry of M 
with its induced metric). 

On an abstract Riemannian manifold, for which there is no “ambient 
Euclidean space” in which to differentiate, this technique is not available. 
Thus we have to find some way to make sense of the acceleration of a curve 
in an abstract manifold. Let 7 : {a,b) M be such a curve. As you know 
from your study of smooth manifold theory, the velocity vector j(t) has 
a coordinate-independent meaning for each t € M, and its expression in 
any coordinate system matches the usual notion of velocity of a curve in 
R”: j{t) = {j^{t ),..., 7 "(t)). However, unlike the velocity, the acceleration 
vector has no such coordinate-invariant interpretation. For example, con¬ 
sider the parametrized circle in the plane given in Euclidean coordinates by 
{x{t),y{t)) = (cost,sint) (Figure 4.1). Its acceleration at time t is the unit 





Connections 


49 



FIGURE 4.3. 7 (to) and 7 ( 1 ) lie in different vector spaces. 


vector {x{t),y{t)) = (—cost, — sint). But in polar coordinates, the same 
curve is described by {r{t),9{t)) = (l,t) (Figure 4.2). In these coordinates, 
the acceleration vector is {r{t),9{t)) = (0,0)! 

The problem is this: If we wanted to make sense of 7(fo) by differenti¬ 
ating 7 (t) with respect to t, we would have to write a difference quotient 
involving the vectors jit) and 7(to); but these live in different vector spaces 
(T.y(()M and T.y(tp)M respectively), so it doesn’t make sense to subtract 
them (Figure 4.3). 

The velocity vector j{t) is an example of a “vector field along a curve,” a 
concept for which we will give a rigorous definition presently. To interpret 
the acceleration of a curve in a manifold, what we need is some coordinate- 
invariant way to differentiate vector fields along curves. To do so, we need a 
way to compare values of the vector field at different points, or, intuitively, 
to “connect” nearby tangent spaces. This is where a connection comes in: 
it will be an additional piece of data on a manifold, a rule for computing 
directional derivatives of vector fields. 


Connections 

It turns out to be easiest to define a connection first as a way of differen¬ 
tiating sections of vector bundles. Later we will adapt the definition to the 
case of vector fields along curves. 

Let tt: E ^ M he a vector bundle over a manifold M, and let E,{M) 
denote the space of smooth sections of E. A connection in if is a map 

V: T(M) X £(M) ^ £(M), 

written {X,Y) 1 -^ VjfT, satisfying the following properties: 

(a) VxY is linear over C°°{M) in X: 

Vfx,+gX,Y = fVx,Y + gVx,Y for f,g€C°°{M); 

"Htn'th.c.m.n'tLc.nl ~Ph.y.S.LC.A. 
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(b) is linear over R in F: 

^xiaYi + hY^) = aVxYi + Wxb'a for a, 6 e R; 

(c) V satisfies the following product rule: 

Wx{fY) = fWxY + {Xf)Y for/GC“(M). 

The symbol V is read “del,” and XxY is called the covariant derivative of 
Y in the direction of X. 

Although a connection is defined by its action on global sections, it fol¬ 
lows from the definitions that it is actually a local operator, as the next 
lemma shows. 

Lemma 4.1. If X is a connection in a bundle E, X G 7{M), Y G £(M), 
and p G M, then VxY\p depends only on the values of X and Y in an 
arbitrarily small neighborhood of p. More precisely, if X = X and Y = Y 
on a neighborhood of p, then VxY\p = V^F|p. 

Proof. First consider Y. Replacing F by F — F, it clearly suffices to show 
that XxY\p = 0 if F vanishes on a neighborhood U of p. 

Choose a bump function p G C°°(M) with support in U such that (p{p) = 
1. The hypothesis that F vanishes on U implies that (pY = 0 on all of M, 
so Xx{pY) = Vx(0 • pY) = tlXxipY) = 0. Thus for any A G T(M), the 
product rule gives 

Q = Xx{pY) = {Xp)Y + p{XxY). (4.1) 

Now F = 0 on the support of p, so the first term on the right is identically 
zero. Evaluating (4.1) at p shows that XxY\p = 0. The argument for X is 
similar but easier. □ 

Exercise 4.1. Complete the proof of Lemma 4.1 by showing that XxY 
and XxY agree at p if A = A on a neighborhood of p. 

The preceding lemma tells us that we can compute XxY at p knowing 
only the values of A and F near p. In fact, as the next lemma shows, we 
need only know the value of A at p itself. 

Lemma 4.2. With notation as in Lemma 4-1, XxY\p depends only on the 
values of Y in a neighborhood of p and the value of X at p. 

Proof. By linearity, it suffices to show that XxY\p = 0 whenever Xp = 
0. Choose a coordinate neighborhood U of p, and write A = X'^di in 
coordinates on U, with A®(p) = 0. Then, for any F G £(M), 

VxL|p = Vx^a,F|p = X\p)Xa,Y\p = 0. 

In the first equality, we used Lemma 4.1, which allows us to evaluate XxY\p 
by computing locally in U; in the second, we used linearity of XxY over 
C'°°(M)inA. □ 
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Because of Lemma 4.2, we can write Vx^X place of Vx^Ip- This can 
be thought of as a directional derivative of F at p in the direction of the 
vector Xp. 

Linear Connections 

Now we specialize to connections in the tangent bundle of a manifold. A 
linear connection on M is a connection in TM, i.e., a map 

V: T(M) X 7{M) T(M) 

satisfying properties (a)-(c) in the definition of a connection above. 

A linear connection on M is often simply called a connection on M. (The 
term affine connection is also frequently used synonymously with linear 
connection, although some authors make a subtle distinction between the 
two terms; cf., for example, [KN63, volume 1].) 

Although the definition of a linear connection resembles the characteriza¬ 
tion of (^)-tensor fields given by the tensor characterization lemma (Lemma 
2.4), a linear connection is not a tensor field because it is not linear over 
C°°{M) in Y, but instead satisfies the product rule. 

Next we examine how a linear connection appears in components. Let 
{Ei} be a local frame for TM on an open subset U C M. We will usually 
work with a coordinate frame Ei = 9*, but it is useful to start by doing the 
computations for more general frames. For any choices of the indices i and 
j, we can expand V StEj in terms of this same frame: 

Vb.A, (4.2) 

This defines functions F*- on U, called the Christoffel symbols of V with 
respect to this frame. The following lemma shows that the action of the 
connection V on is completely determined by its Christoffel symbols. 

Lemma 4.3. Let V be a linear connection, and let X,Y G 7{U) be ex¬ 
pressed in terms of a local frame by X = X'‘Ei, Y = Y^ Ej. Then 

XxY = (AF'= -h XXX^)Ek. (4.3) 

Proof. Just use the defining rules for a connection and compute: 


XxY = Xx{Y^E,) 

= {XYCE, + Y^Vx^eX 
= (XY^Ej + XXXeXj 
= XY^Ej + XXXXk- 

Renaming the dummy index in the first term yields (4.3). □ 
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Existence of Connections 

So far, we have studied properties of connections, but have not produced 
any, so you might be wondering if they are plentiful or rare. In fact, they are 
quite plentiful, as we will show shortly. Let’s begin with a trivial example: 
on R”, define the Euclidean connection by 

Vx{Y^d,) = {XYCd,. (4.4) 

In other words, VxY is just the vector field whose components are the 
ordinary directional derivatives of the components of Y in the direction X. 
It is easy to check that this satisfies the required properties for a connection, 
and that its Christoffel symbols in standard coordinates are all zero. In fact, 
there are many more connections on R", or indeed on any manifold covered 
by a single coordinate chart; the following lemma shows how to construct 
all of them explicitly. 

Lemma 4.4. Suppose M is a manifold covered by a single coordinate 
chart. There is a one-to-one correspondence between linear connections on 
M and choices of smooth functions {L^^ } on M, by the rule 

VxY = {X^d^Y’^ + X^Y^T^j) dk. (4.5) 

Proof. Observe that (4.5) is equivalent to (4.3) when Ei = diis a coordinate 
frame, so for every connection the functions {L^^-} defined by (4.2) satisfy 

(4.5) . On the other hand, given {L*-}, it is easy to see by inspection that 

(4.5) is smooth if X and Y are, linear over R in Y, and linear over C°°{M) 

in X, so only the product rule requires checking; this is a straightforward 
computation left to the reader. □ 

Exercise 4.2. Complete the proof of Lemma 4.4. 

Proposition 4.5. Every manifold admits a linear connection. 

Proof. Cover M with coordinate charts {Ua}', the preceding lemma guar¬ 
antees the existence of a connection V“ on each 11^. Choosing a partition 
of unity {(/?«} subordinate to {Ua}, we’d like to patch the V“s together by 
the formula 


VxY = Y,TcVxY. 

O' 


(4.6) 


Again, it is obvious by inspection that this expression is smooth, linear over 
R in Y, and linear over C°°{M) in X. We have to be a bit careful with 
the product rule, though, since a linear combination of connections is not 
necessarily a connection. (You can check, for example, that if and 
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are connections, neither nor satisfies the product rule.) By 

direct computation, 

Vx(/r) = ^^„v^(/y) 

O' 

= J2MiXf)Y + fV^^Y) 

O' 

= {Xf)Y + fY,^c.^xY 

a 

= {Xf)Y + fXxY. 


□ 


Covariant Derivatives of Tensor Fields 

By definition, a linear connection on M is a way to compute covariant 
derivatives of vector fields. In fact, any linear connection automatically 
induces connections on all tensor bundles over M, and thus gives us a way 
to compute covariant derivatives of any tensor field. 

Lemma 4.6. Let X he a linear connection on M. There is a unique con¬ 
nection in each tensor bundle Tj^M, also denoted V, such that the following 
conditions are satisfied. 

(a) On TM, V agrees with the given connection. 

(b) On T'^M , V is given by ordinary differentiation of functions: 

Fxf = Xf. 

(c) V obeys the following product rule with respect to tensor products: 

Xx{F 0 G) = (XxF) 0 G + F 0 (VxG). 

(d) V commutes with all contractions: if “tr” denotes the trace on any 
pair of indices, 


Vx(try) =tr(VxL). 

This connection satisfies the following additional properties: 

(i) V obeys the following product rule with respect to the natural pairing 
between a covector field to and a vector field Y: 

Xx (c^, Y) = (Vxc^, Y) + C, XxY) . 
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(ii) For any F G 7i{M), vector fields Yi, and l-/orms , 

I 

U (4- 

k 

- ^ \ , u;', Fi,..., VFi,..., Ffc). 

i=l 

Exercise 4.3. Prove Lemma 4.6. [Hint: Show that the defining properties 
imply (i) and (ii); then use these to prove existence.] 


Exercise 4.4. Let V be a linear connection. If w is a 1-form and X a 
vector field, show that the coordinate expression for Vxw is 

- X^ujjVQ da;^ 

where {r|, } are the Christoffel symbols of the given connection V on TM. 

Find a coordinate formula for VxF, where F G Tf (M) is a tensor field of 
any rank. 

Because the covariant derivative XxY of a vector field (or tensor field) 

Y is linear over C°°{M) in X, it can be used to construct another tensor 
field called the total covariant derivative, as follows. 

Lemma 4.7. //V is a linear connection on M, and F G Tf(M), the map 
VF: 7^{M) X • • • X 7^{M) x T(M) x • • • x T(M) ^ C^{M), given by 

XF{u\...,J,Yi,...,Yk,X) = XxF{u\...,J,Yi,...,Yk), 

defines a -tensor field. 

Proof. This follows immediately from the tensor characterization lemma: 

XxF is a tensor field, so it is multilinear over C°°{M) in its k+l arguments; 
and it is linear over C°°{M) in X by definition of a connection. □ 

The tensor field XF is called the total covariant derivative of F. For 
example, let u be a smooth function on M. Then Xu G T^(M) is just the 
1-form du, because both tensors have the same action on vectors: {Xu, X) = 

Xxu = Xu = {du,X). The 2-tensor X^u = X{Xu) is called the covariant 
Flessian of u. 

Exercise 4.5. Show that for any u G C°°{M) and X,Y G T(M), 

X^u{X, Y) = Y{Xu) - {XyX)u. (4.8) 

Titn'th.c.m.n'tLc.nl “Ph.y.S.t.c.A. 
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When we write the components of a total covariant derivative in terms 
of coordinates, we use a semicolon to separate indices resulting from dif¬ 
ferentiation from the preceding indices. Thus, for example, if V is a vector 
field written in components as V = the components of the (J)-tensor 

field W are written so that 

VV = Y^.jdi 0 dx^, 

with 

Y\j = d,Y^ + Y^T),. 

More generally, the next lemma gives a formula for the components of 
covariant derivatives of arbitrary tensor fields. 

Lemma 4.8. Let V be a linear connection. The components of the total 
covariant derivative of a -tensor field F with respect to a coordinate 
system are given by 

l k 

j ^ u ...ih mp ^ i-i...p...ik mis' 

s^l 

Exercise 4.6. Prove Lemma 4.8. 


Vector Fields Along Curves 

Without further qualification, a curve in a manifold M always means for 
us a smooth, parametrized curve; that is, a smooth map "/:!—>■ M, where 
/ C R is some interval. Unless otherwise specified, we won’t worry about 
whether the interval is open or closed, bounded or unbounded. A curve 
segment is a curve whose domain is a closed, bounded interval [a, b] C R. 

If 7 : / ^ M is a curve and the interval I has an endpoint, smoothness 
of 7 means by definition that 7 extends to a smooth curve defined on some 
open interval containing I. It can be shown (though we will not do so) 
that this notion of smoothness is equivalent to the component functions 7 * 
in any local coordinates having one-sided derivatives of all orders at the 
endpoint, or having derivatives of all orders that extend continuously to 
the endpoint. When working with a smooth curve 7 defined on an interval 
that has one or two endpoints, we can always extend 7 to a smooth curve 
on a slightly larger open interval, work with that curve, and restrict back 
to the original interval; the values on I of any continuous function of the 
derivatives of 7 are independent of the extension. Thus in proofs we can 
assume whenever convenient that 7 is defined on an open interval. 
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field. field. 


Let 7 : / ^ M be a curve. At any time t G I, the velocity 7 (t) of 7 is 
invariantly defined as the push-forward 7 *(fi/(it). It acts on functions by 

i{t)f = 

As mentioned above, this corresponds to the usual notion of velocity in 
coordinates. If we write the coordinate representation of 7 as 7 (t) = 
then 


j{t) = Y{t)di. (4.9) 

(A dot always denotes the ordinary derivative with respect to t.) 

A vector field along a curve 7 : J ^ M is a smooth map V: I ^ TM 
such that V{t) € for every t G I. We let denote the space of 

vector fields along 7 . The most obvious example of a vector field along a 
curve 7 is its velocity vector: j{t) M for each t, and the coordinate 

expression (4.9) shows that it is smooth. Here is another example: If 7 is 
a curve in R^, let N{t) = Jjit), where J is counterclockwise rotation by 
7 r/ 2 , so N{t) is normal to fi{t). In components, N{t) = (— 7 ^(t), 7 ^(t)), so 
A^ is a smooth vector field along 7 . 

A large class of examples is provided by the following construction: Sup¬ 
pose 7 : / ^ M is a curve, and V € 7{M) is a vector field on M. For each 
t G I, let V{t) = - It is easy to check in coordinates that V is smooth. 

A vertor field V along 7 is said to be extendible if there exists a vector 
field H on a neighborhood of the image of 7 that is related to V in this 
way (Figure 4.4). Not every vector field along a curve need be extendible; 
for example, if 7 (^ 1 ) = 7 (^ 2 ) but 7 (ti) yf 7 (^ 2 ) (Figure 4.5), then 7 is not 
extendible. 
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Covariant Derivatives Along Curves 

Now we can address the question that originally motivated the definition 
of connections: How can we make sense of the directional derivative of a 
vector field along a curve? 

Lemma 4.9. Let V be a linear connection on M. For each curve 7 : I ^ 
M, V determines a unique operator 

A:T(7)^d'(7) 

satisfying the following properties: 

(a) Linearity over R; 

Dt{aV+ bW) = aDtV+ bDtW foraA&'R- 
(&) Product rule: 

DAfV) = fV + fDtV forfGC°^{I). 

(c) If V is extendible, then for any extension V ofV, 

DtV{t) = 


For any V G ‘^( 7 ), DtV is called the covariant derivative ofV along 7 . 

Proof. First we show uniqueness. Suppose Dt is such an operator, and let 
to G d be arbitrary. An argument similar to that of Lemma 4.1 shows that 
the value of DtV at to depends only on the values of V in any interval 
(to — £,to + £) containing to. (If I has an endpoint, extend 7 to a slightly 
bigger open interval, prove the lemma there, and then restrict back to I.) 
Choose coordinates near 7 (to), and write 

v{t) = vAt)d, 

near to. Then by the properties of Dt, since dj is extendible, 

DtVito) = VAto)d, + VAto)V^^to)dj 

= (vHto) + C^(to)y(to)r?,(7(to))) d,. 

This shows that such an operator is unique if it exists. 

For existence, if 7 (/) is contained in a single chart, we can define DtV by 
(4.10); the easy verification that it satisfies the requisite properties is left 
to the reader. In the general case, we can cover '}{!) with coordinate charts 
and define DtV by this formula in each chart, and uniqueness implies the 
various definitions agree whenever two or more charts overlap. □ 
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Exercise 4.7. Improve Lemma 4.1 by showing that VXpY actually de¬ 
pends only on the values of Y along any curve tangent to Xp. More pre¬ 
cisely, suppose that 7 : (— e, e) ^ M is a curve with 7(0) = p and 7(0) = Xp, 
and suppose Y and Y are vector fields that agree along 7 . Show that 

Vxp’L = XxpY. 


Geodesics 

Armed with the notion of covariant differentiation along curves, we can 
now define acceleration and geodesics. 

Let M be a manifold with a linear connection V, and let 7 be a curve 
in M. The acceleration of 7 is the vector field I?t 7 along 7 . A curve 7 is 
called a geodesic with respect to V if its acceleration is zero: Hj-y = 0. 

Exercise 4.8. Show that the geodesics on R” with respect to the Eu¬ 
clidean connection (4.4) are exactly the straight lines with constant speed 
parametrizations. 


Theorem 4.10. (Existence and Uniqueness of Geodesics) Let M be 

a manifold with a linear connection. For any p G M, any V G TpM, and 
any to G R, there exist an open interval / C R containing to and a geodesic 
7 : / ^ M satisfying 7(<o) = P> 7 (^ 0 ) = V- Any two such geodesics agree 
on their common domain. 

Proof. Choose coordinates (s*) on some neighborhood U oip. From (4.10), 
a curve 7 : / ^ [/ is a geodesic if and only if its component functions 
7 (t) = (x^ft), ... ,a;"(t)) satisfy the geodesic equation 

i^(t) -I- x\f)x^{t)T^j{x{f)) = 0. (4-11) 

This is a second-order system of ordinary differential equations for the 
functions x*(t). The usual trick for proving existence and uniqueness for a 
second-order system is to introduce auxiliary variables v* = xf to convert 
it to the following equivalent first-order system in twice the number of 
variables: 


x\t)=v\t), 

G(t) = -v^{i)v^{t)T^,j{x{t)). 

By the existence and uniqueness theorem for first-order ODEs (see, for 
example, [B 0086 , Theorem IV.4.1]), for any {p,V) G U x R”, there exist 
e > 0 and a unique solution p: {to — s,to + e) U x R” to this system 
satisfying the initial condition p{to) = {p,V). If we write the component 
functions of p as p{t) = (a;*(<), v*(<)), then we can easily check that the 
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FIGURE 4.6. Uniqueness of geodesics. 


curve 7 (f) = {x^ (t),..., (t)) in U satisfies the existence claim of the 

lemma. 

To prove the uniqueness claim, suppose "f,a: I ^ M are geodesics de¬ 
fined on an open interval with 7(fo) = cr(fo) and 7 (^ 0 ) = d(fo)- By the 
uniqueness part of the ODE theorem, they agree on some neighborhood of 
to- Let [3 be the supremum of numbers b such that they agree on 
If /3 G /, then by continuity 7(/3) = a{(3) and 7(/3) = and applying 

local uniqueness in a neighborhood of /3, we conclude that they agree on 
a slightly larger interval (Figure 4.6), which is a contradiction. Arguing 
similarly to the left of to, we conclude that they agree on all of I. □ 

It follows from the uniqueness statement in the preceding theorem that 
for any p G M and V G TpM, there is a unique maximal geodesic (one 
that cannot be extended to any larger interval) I —> M with 7(0) = p 
and 7(0) = V, defined on some open interval I; just let I be the union 
of all open intervals on which such a geodesic is defined, and observe that 
the various geodesics agree where they overlap. This maximal geodesic is 
often called simply the geodesic with initial point p and initial velocity V, 
and is denoted jv- (The initial point p does not need to be specified in 
the notation, because it can implicitly be recovered from V hy p = 7r(U), 
where tt: TM ^ M is the natural projection.) 


Parallel Translation 

One more construction involving covariant differentiation along curves that 
will be useful later is parallel translation. 

Let M be a manifold with a linear connection V. A vector field V along 
a curve 7 is said to be parallel along 7 with respect to V if DtV = 0. Thus 
a geodesic can be characterized as a curve whose velocity vector field is 
parallel along the curve. A vector field U on M is said to be parallel if it is 
parallel along every curve; it is easy to check that V is parallel if and only 
if its total covariant derivative VU vanishes identically. 
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Exercise 4.9. Let 7 : / ^ R" be any curve. Show that a vector field V 
along 7 is parallel with respect to the Euclidean connection if and only if its 
components are constants. 


The fundamental fact about parallel vector fields is that any tangent 
vector at any point on a curve can be uniquely extended to a parallel 
vector field along the entire curve. 

Theorem 4.11. (Parallel Translation) Given a curve 7 : / ^ M, Iq G 

I, and a vector Vq G there exists a unique parallel vector field V 

along 7 such that V{to) = Vq. 

The vector field asserted to exist in Theorem 4.11 is called the parallel 
translate of Vq along 7 (Figure 4.7). The proof of the theorem will use 
the following basic fact about ordinary differential equations: it says that, 
although in general we can only guarantee that solutions to ODEs exist for 
a short time, solutions to linear equations always exist for all time. 

Theorem 4.12. (Existence and Uniqueness for Linear ODEs) Let 

I C R be an interval, and for 1 < j,k < n let Aj : I R be arbitrary 
smooth functions. The linear initial-value problem 


V\t)=A^{t)V\t), 

V'^ito) = B’^ 


(4.12) 


has a unique solution on all of I for any to G I and any initial vector 
(Bi,...,B") e R". 


T^€Lth.C.I*L€Ltl.C.€LL "Ph-^SjLIILA. 



Geodesics 


61 


Exercise 4.10. Prove the following Escape Lemma: Let P be a vector 
field on a manifold M, and let 7 : (a,/3) ^ M be an integral curve of Y. If 
/3 < 00 and the image of 7 is contained in some compact subset K C M, 
then 7 extends to an integral curve on (a, /3 + e) for some e > 0. (See [B 0086 , 
Lemma IV.5.1].) 

Exercise 4.11. Prove Theorem 4.12, as follows. Consider the vector field 
y on / X R” given by 

y°(a;°,...,rr") = l, 

y*^(a;°,..., x") = A'°{x^)x\ k — 1 ,..., n. 

(a) Show that any solution to (4.12) is the projection to R" of an integral 
curve of Y. 

(b) For any compact subinterval K G I, show there exists a positive con¬ 
stant C such that every solution V{t) = (V^ (t),..., y”(t)) to (4.12) 
on K satisfies 

^(e-^*|y(t)|^)< 0 . 

(Here |y(t)| is just the Euclidean norm.) 

(c) If an integral curve of Y is defined only on some proper subinterval of 
I, use Exercise 4.10 above to derive a contradiction. 

Proof of Theorem 4-11. First suppose 7 (/) is contained in a single coordi¬ 
nate chart. Then, using formula (4.10), V is parallel along 7 if and only 
if 

= fc=l,...,n. (4.13) 

This is a linear system of ODEs for ..., V^ft)). Thus Theorem 4.12 

guarantees the existence and uniqueness of a solution on all of / with any 
initial condition Vfto) = Vq- 

Now suppose 7 ( 1 ) is not covered by a single chart. Let (3 denote the 
supremum of all b > to for which there is a unique parallel translate on 
[to,b]. Clearly (3 > to, since for b close enough to to, G[to,b] is contained 
in a single chart and the above argument applies. Then a unique parallel 
translate V exists on [to,P) (Figure 4.8). If (3 G I, choose coordinates on an 
open set containing ^{(3 — 6, (3+6) for some positive 6. (As usual, we assume 
7 has been extended to an open interval if necessary.) Then there exists a 
unique parallel vector field V on {P — 6,P+6) satisfying the initial condition 
V{P — 6/2) = V{(3 — 6/2). By uniqueness, V = V on their common domain, 
and therefore V is an extension of V past (3, which is a contradiction. □ 

We conclude this chapter with an important remark. If 7 : 7 ^ M is a 
curve and to,ti G I, parallel translation defines an operator 

PtoG '■ Pj{to)^ (4-14) 
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FIGURE 4.8. Existence and uniqueness of parallel translates. 


by setting Ptgt^Vo = V{ti), where V is the parallel translate of Vq along 7 . 
It is easy to check that this is a linear isomorphism between and 

(because the equation of parallelism is linear). The next exercise 
shows that covariant differentiation along 7 can be recovered from this 
operator. This is the sense in which a connection “connects” nearby tangent 
spaces. 


Exercise 4.12. Let V be a linear connection on M. Show that covariant 
differentiation along a curve 7 can be recovered from parallel translation, by 
the following formula: 


DtV{to) = lim 

t^tQ 


- Vjto) 

t — to 


[Hint: Use a parallel frame along 7 .] 
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Problems 

4-1. Let V be a connection on M. Suppose we are given two local frames 
{Ei] and {Ej} on an open subset U C M, related by Ei = AjEj for 
some matrix of functions iM)- Let and r*- denote the Christoffel 
symbols of V with respect to these two frames. Compute a transfor¬ 
mation law expressing in terms of and . 

4-2. Let V be a linear connection on M, and define a map r: T(M) x 
T(M) ^ T(M) by 

t{X, Y) = VxY - VyX - [X, y]. 

(a) Show that r is a (^)-tensor field, called the torsion tensor of V. 

(b) We say V is symmetric if its torsion vanishes identically. Show 
that V is symmetric if and only if its Christoffel symbols with re¬ 
spect to any coordinate frame are symmetric: L^^- = [Warn¬ 
ing: They might not be symmetric with respect to other frames.] 

(c) Show that V is symmetric if and only if the covariant Hessian 

of any smooth function u G C°°{M) is a symmetric 2-tensor 

field. 

(d) Show that the Euclidean connection V on R" is symmetric. 

4-3. In your study of differentiable manifolds, you have already seen an¬ 
other way of taking “directional derivatives of vector fields,” the Lie 
derivative LxY. 

(a) Show that the map C: T(M) x T(M) ^ 7{M) is not a connec¬ 
tion. 

(b) Show that there is a vector field on that vanishes along the 
x^-axis, but whose Lie derivative with respect to di does not 
vanish on the x^-axis. [This shows that Lie differentiation does 
not give a well-defined way to take directional derivatives of 
vector fields along curves.] 

4-4. (a) If and are any two linear connections on M, show that 

the difference between them defines a (^)-tensor field A by 

H(x,y) = v^y-v'^r, 

called the difference tensor. Thus, if is any linear connection 
on M, the set of all linear connections is precisely {V° -l-H : A G 
7f(M)}. 

(b) Show that and determine the same geodesics if and 
only if their difference tensor is antisymmetric, i.e., A(X,Y) = 
-A{Y,X). 
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(c) Show that V° and have the same torsion tensor (Prob¬ 
lem 4-2) if and only if their difference tensor is symmetric, i.e., 
A(X,Y)=A(Y,X). 

4-5. Let V be a linear connection on M, let {Ei} be a local frame on some 
open subset U C M, and let {(p’'} be the dual coframe. 

(a) Show that there is a uniquely determined matrix of 1-forms luA 
on U, called the connection l-forms for this frame, such that 

XxE^ = ioA{X)Ej 

for all X G TM. 

(b) Prove Carton’s first structure equation: 

dipi = {fA i\ ujA j 

where {r^,... ,t”} are the torsion 2-forms, defined in terms of 
the torsion tensor t (Problem 4-2) and the frame {Ei} by 

t{X,Y)=t^X,Y)E,. 





5 

Riemannian Geodesics 


If we are to use geodesics and covariant derivatives as tools for studying 
Riemannian geometry, it is evident that we need a way to single out a 
particular connection on a Riemannian manifold that reflects the properties 
of the metric. In this chapter, guided by the example of an embedded 
submanifold of R”, we describe two properties that determine a unique 
connection on any Riemannian manifold. The first property, compatibility 
with the metric, is easy to motivate and understand. The second, symmetry, 
is a bit more mysterious. 

After defining the Riemannian connection and its geodesics, we investi¬ 
gate the exponential map, which conveniently encodes the collective behav¬ 
ior of geodesics and allows us to study the way they change as the initial 
point and initial vector vary. Having established the properties of this map, 
we introduce normal neighborhoods and Riemannian normal coordinates. 
Finally, we return to our model Riemannian manifolds and determine their 
geodesics. 


The Riemannian Connection 


We are going to show that on each Riemannian manifold there is a natu¬ 
ral connection that is particularly suited to computations in Riemannian 
geometry. Since we get most of our intuition about Riemannian manifolds 
from studying submanifolds of R" with the induced metric, let’s start by 
examining that case. As a guiding principle, consider the idea mentioned 
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in the beginning of Chapter 4: A geodesic in a submanifold of R" should 
be “as straight as possible,” which we take to mean that its acceleration 
vector field should have zero tangential projection onto TM. 

To express this in the language of connections, let M C R" be an em¬ 
bedded submanifold. Any vector field on M can be extended to a smooth 
vector field on R" by the result of Exercise 2.3(b). Define a map 

: T(M) X 7{M) T(M) 


by setting 


VIY:=7t^{VxY), 

where X and Y are extended arbitrarily to R", V is the Euclidean con¬ 
nection (4.4) on R", and for any point p € M, : TpR" ^ TpM is the 
orthogonal projection. As the next lemma shows, this turns out to be a 
linear connection on M, called the tangential connection. 

Lemma 5.1. The operator is well defined, and is a connection on M. 

Proof. Since the value of XxY at a point p G M depends only on Xp, 
VJF is clearly independent of the choice of vector field extending X. On 
the other hand, because of the result of Exercise 4.7, the value of XxY at p 
depends only on the values of Y along a curve whose initial tangent vector 
is Xp] taking the curve to lie entirely in M shows that Xj^Y depends only 
on the original vector field Y G 7(M). Thus is well defined. Smoothness 
follows easily by expressing XxY in terms of an adapted orthonormal frame 
as in Problem 3-1. 

It is obvious from the definition that Xj^Y is linear over C°°{M) in X 
and over R in E, so to show that it is a connection, only the product rule 
needs checking. Let / G C°°{M) be extended arbitrarily to R". Evaluating 
along M, we get 


Xl{fY)=7r^{Xx{fY)) 


= {Xf)TT^Y + fTr^{XxY) 
= {Xf)Y + /V JE 


Thus is a connection. □ 

There is a celebrated (and hard) theorem of John Nash [Nas56] that 
says any Riemannian metric on any manifold can be realized as the in¬ 
duced metric of some embedding in a Euclidean space. Thus, in a certain 
sense, one would lose no generality by studying only submanifolds of R" 
with their induced metrics, for which the tangential connection would suf¬ 
fice. However, when one is trying to understand intrinsic properties of a 
Riemannian manifold, an embedding introduces a great deal of extraneous 
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information, and in some cases actually makes it harder to discern which 
geometric properties depend only on the metric. Our task in this chapter is 
to distinguish some important properties of the tangential connection that 
make sense for connections on an abstract Riemannian manifold, and to 
use them to single out a unique connection in the abstract case. 

The Euclidean connection on R" has one very nice property with respect 
to the Euclidean metric: it satisfies the product rule 


Vx {Y, Z) = (VxE, z) + (r, ^xZ) , 

as you can verify easily by computing in terms of the standard basis. It is 
almost immediate that the tangential connection has the same property, if 
we now interpret all the vector fields as being tangent to M and interpret 
the inner products as being taken with respect to the induced metric on 
M (see Exercise 5.2 below). 

This property makes sense on an abstract Riemannian manifold, and 
seems so natural and desirable that it has a name. Let g be a Riemannian 
(or pseudo-Riemannian) metric on a manifold M. A linear connection V is 
said to be compatible with g if it satisfies the following product rule for all 
vector fields X, Y, Z. 

Vx (Y, Z) = lyxY, Z) + (r, XxZ). 

Lemma 5.2. The following conditions are equivalent for a linear connec¬ 
tion X on a Riemannian manifold: 

(a) V is compatible with g. 

(b) Vg = 0. 

(c) IfV,W are vector fields along any curve 7 , 

^(R, W) = {DtV, W) + (E, DtW). 

{d) IfV,W are parallel vector fields along a curve 7 , then (y,W) is con¬ 
stant. 

(e) Parallel translation Pt^ti ■ is an isometry for each 

to, ti (Figure 5.1). 

Exercise 5.1. Prove Lemma 5.2. 

Exercise 5.2. Prove that the tangential connection on any embedded sub¬ 
manifold of R" is compatible with the induced Riemannian metric. 

It turns out that requiring a connection to be compatible with the metric 
is not enough to determine a unique connection, so we turn to another key 
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FIGURE 5.1. Parallel translation is an isometry. 


property of the tangential connection. This involves the torsion tensor of 
the connection (see Problem 4-2), which is the (^)-tensor field t: T(M) x 
T(M) ^ T(M) defined by 

t{X, Y) = VxY - VyX - [X, y]. 

A linear connection V is said to be symmetric if its torsion vanishes iden¬ 
tically, that is, if 


Vjfy-VyA = [A,r]. 

Lemma 5.3. The tangential connection on an embedded submanifold M C 
R” is symmetric. 

Exercise 5.3. Prove Lemma 5.3. [Hint: If X and Y are vector fields on 
R" that are tangent to M at points of M, so is [X, Y] by Exercise 2.3.] 


Theorem 5.4. (Fundamental Lemma of Riemannian Geometry) 

Let {M,g) be a Riemannian {or pseudo-Riemannian) manifold. There ex¬ 
ists a unique linear connection X on M that is compatible with g and sym¬ 
metric. 

This connection is called the Riemannian connection or the Levi-Civita 
connection of g. 

Proof. We prove uniqueness first, by deriving a formula for V. Suppose, 
therefore, that V is such a connection, and let X,Y, Z G 7{M) be arbitrary 
vector fields. Writing the compatibility equation three times with A, Y, Z 
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cyclically permuted, we obtain 

X{Y, Z) = {VxY, Z) + (y, VxZ) 

Y{Z,X) = {XyZ,X) + {Z,XyX) 

Z{X,Y) = {XzX,Y) + {X,XzY). 

Using the symmetry condition on the last term in each line, this can be 
rewritten as 

y (r, Z) = {XxY, Z) + {Y, XzX) + {Y, [X, Z\) 

Y{Z, X) = {XyZ, X) + (Z, VxU) + (Z, [y, X]) 

Z(X, y) = {XzX, Y) + {X, XyZ) + {X, [Z, y]). 

Adding the first two of these equations and subtracting the third, we obtain 

X{Y, Z) + y (Z, X)-Z{X, Y) = 

2{VxY, Z) + (y, [A, Z]) + (Z, [y, a]) - (a, [z, y]). 


Finally, solving for {XxY,Z), we get 

(VxU,z) = ^(A(y,z) + y(z,A)-z(A,y) 

- (y, [A, z]) - (z, [y. A]) + (A, [z, y])). (5.1) 

Now suppose and are two connections that are symmetric and 
compatible with g. Since the right-hand side of (5.1) does not depend on 
the connection, it follows that (V^y — V^y, Z) = 0 for all A, Y, Z. This 
can only happen if VxY = V^Y for all A and Y, so = V^. 

To prove existence, we use (5.1), or rather a coordinate version of it. It 
suffices to prove that such a connection exists in each coordinate chart, 
for then uniqueness ensures that the connections constructed in different 
charts agree where they overlap. 

Let ([/, (a;*)) be any local coordinate chart. Applying (5.1) to the coor¬ 
dinate vector fields, whose Lie brackets are zero, we obtain 

{Xoidj,di) = ^ {di {dj,di) + dj {di,d^) - di (5*, dj)). (5.2) 

Recall the definitions of the metric coefficients and the Christoffel symbols: 

Inserting these into (5.2) yields 

^ {^^gJl + d^g^i - dig,j). (5.3) 
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Finally, multiplying both sides by the inverse matrix and noting that 
gmig’’’" = 6^, we get 


^ij = \g^^ i.d,gji + djQa - dig,j). (5.4) 

This formula certainly defines a connection in each chart, and it is evident 
from the formula that F^^- = F^j, so the connection is symmetric by Problem 
4-2(b). Thus only compatibility with the metric needs to be checked. By 
Lemma 5.2, it suffices to show that Vg = 0. In terms of a coordinate frame, 
the components of Vg (see Lemma 4.8) are 

gij-,k = dkgij — ^kigij ~ ^kjgn- 
Using (5.3) twice, we conclude 

^kigij 4” ^kjgn ~ 2 T digkj djgki) + ” i^kgji T djgki digkj) 

— dkgij 7 


which shows that gij-k = 0. □ 

A bonus of this proof is that it gives us an explicit formula (5.4) for 
computing the Christoffel symbols of the Riemannian connection in any 
coordinate chart. 

On any Riemannian manifold, we will always use the Riemannian con¬ 
nection from now on without further comment. Geodesics with respect to 
this connection are called Riemannian geodesics, or simply geodesics, as 
long as there is no risk of confusion. 

One immediate consequence of the definitions is the following lemma. If 
7 is a curve in a Riemannian manifold, the speed of 7 at any time t is the 
length of its velocity vector | 7 (t)|. We say 7 is constant speed if | 7 (t)| is 
independent of t, and unit speed if the speed is identically equal to 1 . 

Lemma 5.5. All Riemannian geodesics are constant speed curves. 

Proof. Let 7 be a Riemannian geodesic. Since 7 is parallel along 7 , its 
length I 7 I = ( 7 , 7 )^^^ is constant by Lemma 5.2(d). □ 

Another consequence of the definition is that, because they are defined 
in coordinate-invariant terms, Riemannian connections behave well with 
respect to isometries. 

Proposition 5.6. (Naturality of the Riemannian Connection) Sup¬ 
pose ip: {M,g) {M,g) is an isometry. 
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(a) ip takes the Riemannian connection V of g to the Riemannian con¬ 
nection V of g, in the sense that 

(&) If ^ is a curve in M and V is a vector field along 7 , then 


(p*DtV = Dt{(ptV). 


(c) ip takes geodesics to geodesics: if ^ is the geodesic in M with initial 
point p and initial velocity V, then ip o ^ is the geodesic in M with 
initial point ip{p) and initial velocity (ptY■ 


Exercise 5.4. Prove Proposition 5.6 as follows. For part (a), define a map 
¥?*V: T(M) X T(M) ^ T(M) 


by 


[p*V)xY = 

Show that is a connection on M (called the pullback connection), and 
that it is symmetric and compatible with p; therefore (p*V = V by nnique- 
ness of the Riemannian connection. You will have to unwind the definition 
of the push-forward of a vector held very carefully. For part (b), dehne an 
operator p* Dt : T( 7 ) ^ ‘1^(7) by a similar formula and show that it is equal 
to Dt. 


At this point, it is probably not clear why symmetry is a condition that 
one would want a connection on a Riemannian manifold to satisfy. One 
important feature that recommends it is the fact that the geodesics of the 
Riemannian connection are locally minimizing (see Theorem 6.12 in the 
next chapter). Indeed, the symmetry of the connection plays a decisive role 
in the proof. However, this consideration alone does not force the connec¬ 
tion to be symmetric, as you will show in Problem 6-1. A deeper reason 
for singling out the symmetry condition is the fact that it is natural, in 
the sense of Proposition 5.6. Moreover, since the tangential connection on 
an embedded submanifold of R" is symmetric and compatible with the 
metric, the Riemannian connection must coincide with the tangential con¬ 
nection in that case. The real reason why this connection has been anointed 
as “the” Riemannian connection is this: symmetry and compatibility are 
invariantly-defined and natural properties that force the connection to co¬ 
incide with the tangential connection whenever M is realized as a subman¬ 
ifold of R" with the induced metric (which the Nash embedding theorem 
[Nas56] guarantees is always possible). 
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The Exponential Map 

To further our understanding of Riemannian geodesics, we need to study 
their collective behavior, and in particular, to address the following ques¬ 
tion: How do geodesics change if we vary the initial point and/or the ini¬ 
tial vector? The dependence of geodesics on the initial data is encoded in 
a map from the tangent bundle into the manifold, called the exponential 
map, whose properties are fundamental to the further study of Riemannian 
geometry. 

In Chapter 4 we saw that any initial point p € M and any initial velocity 
vector V G TpM determine a unique maximal geodesic qy (see the remark 
after Theorem 4.10). This implicitly defines a map from the tangent bundle 
to the set of geodesics in M. More importantly, it allows us to define a map 
from (a subset of) the tangent bundle to M itself, by sending the vector V 
to the point obtained by following qy for time 1. 

We note in passing that the results of this section apply with only minor 
changes to pseudo-Riemannian metrics, or indeed to any linear connection. 


Definition and Basic Properties 

To be precise, define a subset £ of T M, the domain of the exponential map, 

by 


£ := {y G TM : qy is defined on an interval containing [0,1]}, 
and then define the exponential map exp: £ ^ M by 

exp(y) = qy(l). 

For each p G M, the restricted exponential map exp^ is the restriction of 
exp to the set £p := £ n TpM. (Some authors use the notation “Exp” to 
distinguish the Riemannian exponential map from the exponential map of 
a Lie group, but we follow the more common convention of writing both 
maps with a lowercase “e,” since there will be very little opportunity to 
confuse the two.) 

Proposition 5.7. (Properties of the Exponential Map) 

(o) £ is an open subset ofTM containing the zero section, and each set 
£,p is star-shaped with respect to 0. 

(&) For each V G TM, the geodesic qy is given by 

qy(<) = exp(ty) 

for all t such that either side is defined. 

(c) The exponential map is smooth. 
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(Recall that a subset S' of a vector space is star-shaped with respect to 
a; S S if whenever y G S, so is the line segment from x to y.) 

Before proving the proposition, it is useful to prove the following simple 
rescaling property of geodesics. 

Lemma 5.8. (Rescaling Lemma) For any V G TM and c,t G R, 

lcv{t) = lv{ct), (5.5) 

whenever either side is defined. 

Proof. It suffices to show that ^cv{t) exists and (5.5) holds whenever the 
right-hand side is defined, for then the converse statement follows by re¬ 
placing V by cV, t by ct, and c by 1/c. 

Suppose the domain of is the open interval / C R. For simplicity, 
write 7 = yy, and define a new curve 7 by 7 (t) = 7 (c<), defined on c~^I := 
{t : ct G /}. We will show that 7 is a geodesic with initial point p and 
initial velocity cV; it then follows by uniqueness that it must be equal to 
7cV- 

It is immediate from the definition that 7(0) = 7(0) = p. Writing ■jit) = 
( 7 ^(t),..., 7 "(t)) in any local coordinates, the chain rule gives 

d . 

7 (f) = ^7 (ct) 

= cY{ct). 

In particular, it follows that 7(0) = 07(0) = cV. 

Now let Dt and Dt denote the covariant differentiation operators along 
7 and 7 , respectively. Using the chain rule again in coordinates, 

Dtl{t) = (^^7*(f) +F,^j-(7(t))7 (t)7^(t)^ dk 

= ( 0 ^ 7 '= (ct) -h c^T'ij{'y{ct))Y{ct)j^{ct)) dk 
= c^Dtjict) = 0 . 

Thus 7 is a geodesic, and so 7 = 7 cy as claimed. □ 

Proof of Proposition 5. 7. The rescaling lemma with t = 1 says precisely 
that exp(cU) = 7 cy(l) = 7 y(c) whenever either side is defined; this is (b). 
Moreover, if U G £p, by definition yy is defined at least on [0,1]. Thus for 
0 < t < 1 , the rescaling lemma says that 

exp(tU) = 7 iy(l) = 7 y(t) 

is defined. This shows that £p is star-shaped. 

It remains to show that £ is open and exp is smooth. To do so, we revisit 
the proof of the existence and uniqueness theorem for geodesics (Theorem 
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4.10) and reformulate it in a more invariant way. Let (x*) be any local 
coordinates on an open set U C M, and let denote the standard 

coordinates for Tr~^{U) C TM constructed after the proof of Lemma 2.2. 
Define a vector field G on 7r“^([/) by 


The integral curves of G satisfy the system of ODEs 


x\t)=v\t), 


(5.6) 


(5.7) 


This is exactly the first-order system equivalent to the geodesic equation 
under the substitution , as we observed in the proof of Theorem 

4.10. Stated somewhat more invariantly, the integral curves of G on 7r“^([/) 
project to geodesics under the projection tt: TM M (which in these 
coordinates is just Tr{x{t),v{t)) = x(t)); conversely, any geodesic 7 (t) = 
{x^{t), ..., x"(t)) lifts to an integral curve of G by setting u*(t) = x^{t). 

The importance of G stems from the fact that it actually extends to a 
global vector field on the total space of the tangent bundle TM, called the 
geodesic vector field. The key observation, to be proved below, is that for 
any / G C^(TM), G acts on / by 


Gf{p,V) 


t=0 


(5.8) 


(Here and whenever convenient, we use the notations (p, V) and V inter¬ 
changeably for an element V G TpM, depending on whether we wish to 
emphasize the point at which V is tangent.) Since this formula is indepen¬ 
dent of coordinates, it shows that the various definitions of G given by (5.6) 
in different coordinate systems agree. 

To prove that G satisfies (5.8), we write the components of the geodesic 
jv{t) as x*(t) and those of its velocity vector field as u*(t) = i*(t). Using 
the chain rule and the geodesic equation in the form (5.7), the right-hand 
side of (5.8) becomes 


'IL 

dx^ 


(x(t),u(t))i'=(t) 


^{x{t),v{t))v'^{t) 




K 

dx^ 
Gf{p,V). 




The standard results on global flows of vector fields [Boo86, Theorems 
IV.4.3 and IV.4.5] show that there is an open neighborhood 0 of {0} x TM 
in R xTM and a smooth map 9: 0 ^ TM such that each curve (t) = 
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9{t, {p, V)) is the integral curve of G starting at {p, V), defined on an open 
interval containing 0. 

Now suppose {p,V) G £. This means that the geodesic jv is defined at 
least on the interval [0,1], and therefore so is the integral curve of G starting 
at (p,V) G TM. Since (l,(p, y)) G 0, there is an open neighborhood of 
{^,{P,V)) in R X TM on which the flow of G is defined (Figure 5.2). In 
particular, this means there is an open neighborhood of (p, V) on which 
the flow exists for t G [0,1], and therefore on which the exponential map is 
defined. This shows that £ is open. 

Finally, since geodesics are projections of integral curves of G, it follows 
that the exponential map can be expressed as 

exp(V^) = 7 v(l) = TT o 6»(1, (p, V)) 

wherever it is defined, and therefore exp is smooth. □ 

The naturality of the Riemannian connection (Proposition 5.6) and 
uniqueness of geodesics translate into the following important naturality 
property of the exponential map: 










76 


5. Riemannian Geodesics 


Proposition 5.9. (Naturality of the Exponential Map) Suppose that 
ip: {M,g) {M,g) is an isometry. Then, for any p G M, the following 

diagram commutes: 


TpM 


expp 


exp^(p) 


M -> M 

v 


Exercise 5.5. Prove Proposition 5.9. 


Normal Neighborhoods and Normal Coordinates 


Recall that for any p G M, the restricted exponential map exp^ maps the 
open subset £p of the tangent space TpM into M. 

Lemma 5.10. (Normal Neighborhood Lemma) For any p € M, there 
is a neighborhood V of the origin in TpM and a neighborhood U of p in M 
such that expp : V ^ U is a dijfeomorphism. 

Proof. This follows immediately from the inverse function theorem, once 
we show that (expp)* is invertible at 0. Since TpM is a vector space, there 
is a natural identification Tq{TpM) = TpM. Under this identification, we 
will show that (exp^)* : To{TpM) = TpM TpM has a particularly simple 
expression: it is the identity map! 

To compute (expp)*U for an arbitrary vector V G TpM, we just need to 
choose a curve r in TpM starting at 0 whose initial tangent vector is V, 
and compute the initial tangent vector of the composite curve exppOr(t). 
An obvious such curve is r(t) = tV. Thus 


(expp)*U = 


dt 


(exppOT)(t) 

t=o 


dt 


expp(tU) 

t=o 


_d 

dt 


7v(i) = V. 

t=o 


□ 

Any open neighborhood IX of p € M that is the diffeomorphic image 
under expp of a star-shaped open neighborhood of 0 G TpM as in the 
preceding lemma is called a normal neighborhood of p. If £ > 0 is such that 
expp is a diffeomorphism on the ball Bg(0) C TpM (where the radius of the 
ball is measured with respect to the norm defined by g), then the image 
set expp(iXe(0)) is called a geodesic ball in M. Also, if the closed ball Re(0) 
is contained in an open set V C TpM on which expp is a diffeomorphism, 
then expp(iXe(0)) is called a closed geodesic ball, and expp{dBg{0)) is called 
a geodesic sphere. 
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FIGURE 5.3. Riemannian normal coordinates. 


An orthonormal basis {Ei} for TpM gives an isomorphism E: R" ^ 
TpM by E{x^, ..., x") = x'^Ei. If IX is a normal neighborhood of p, we can 
combine this isomorphism with the exponential map to get a coordinate 
chart 


ip := E ^ o expp ^: IX ^ R". 

Any such coordinates are called {Riemannian) normal coordinates centered 
at p. Given p G M and a normal neighborhood IX of p, there is a one-to-one 
correspondence between normal coordinate charts and orthonormal bases 
at p. 

In any normal coordinate chart centered at p, define the radial distance 
function r by 


i 

and the unit radial vector field d/dr by 

d x^ d 

dr r dx"^ 


(5.9) 


(5.10) 


(See Figure 5.3.) In Euclidean space, r{x) is the distance to the origin, and 
d/dr is the unit vector field tangent to straight lines through the origin. 
As the next proposition shows, they also have special geometric meaning 
for any metric in normal coordinates. (We will strengthen these results 
considerably in the next chapter.) 
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Proposition 5.11. (Properties of Normal Coordinates) Let (IX, (x*)) 
be any normal coordinate chart centered at p. 

(a) For any V = V''di € TpM, the geodesic starting at p with initial 
velocity vector V is represented in normal coordinates by the radial 
line segment 

jv{t) = {tV\...,tV^) (5.11) 

as long as jy stays within IX. 

(&) The coordinates of p are (0,... ,0). 

(c) The components of the metric at p are pij = 6ij. 

(d) Any Euclidean ball {x : r{x) < e} contained in U is a geodesic ball in 
M. 

(e) At any point q G U — p, d/dr is the velocity vector of the unit speed 
geodesic from p to q, and therefore has unit length with respect to g. 

(/) The first partial derivatives of gtj and the Christoff el symbols vanish 
at p. 

Normal coordinates are a vital tool for calculations in Riemannian ge¬ 
ometry, so you should make sure you thoroughly understand the properties 
expressed in the preceding proposition. The proofs are all straightforward 
consequences of the fact that geodesics starting at p have the simple for¬ 
mula (5.11) in normal coordinates. Because of this formula, the geodesics 
starting at p and lying in a normal neighborhood of p are called radial ge¬ 
odesics. (But be warned that geodesics that do not pass through p do not 
in general have a simple form in normal coordinates.) 

Exercise 5.6. Prove Proposition 5.11. 

For later use in studying minimizing properties of geodesics, we need the 
following refinement of the concept of normal neighborhoods. An open set 
W C M is called uniformly normal if there exists some 5 > 0 such that W 
is contained in a geodesic ball of radius 6 around each of its points (Figure 
5.4). 

The proof of the next lemma is fairly technical, thought not really hard. 
You might wish to read the statement now, and come back to the proof 
later. 

Lemma 5.12. (Uniformly Normal Neighborhood Lemma) Given 
p G M and any neighborhood IX ofp, there exists a uniformly normal neigh¬ 
borhood W of p contained in IX. 
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FIGURE 5.4. Uniformly normal neighborhood. 


Proof. Recall that the exponential map is defined on an open subset £ of 
TM. Define a new map F: ^ M x M hy 


F{q,V) = ( 9 ,exp^ V). 

Choose a normal coordinate chart (a;*) for M centered at p, and let (x*, a;*) 
denote the corresponding standard coordinates on TM. In these coordi¬ 
nates, the Jacobian matrix of F at (p, 0) can be written as 




/ dx'‘ dx'‘ \ 

dxi dvi 

d exp* d exp* 

V dxi dyi / 




which is invertible. Thus, by the inverse function theorem, T* is a diffeo- 
morphism from some neighborhood 0 of (p, 0) in TM to its image (Figure 
5.5). 

For any open set ^ C M and any 6 > 0, let denote the subset of TM 
given by 


= {(p,u) G TM : p e y, |v| < (5}, 

where as usual | • | denotes the norm given by g. By writing the inequality 
|u| < ^ in any standard coordinates, it is easy to see that is open in 
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FIGURE 5.5. E is a diffeomorphism near (p, 0). 


the topology of TM. We will show that there is some set of the form 
such that (p, 0) € C 0. Since the topology on TM is generated by 
product open sets in local trivializations, there exists £ > 0 such that the 
set X = {(x, v) : r{x) < 2s, \v\g < 2e} is contained in 0 (Figure 5.6), where 
I • Ig is the Euclidean norm in these coordinates. On the compact set K = 
{(x,u) : r{x) < e, \v\g = e}, the p-norm is continuous and nonvanishing, 
and therefore is bounded above and below by positive constants. Since 
both norms are homogeneous in the sense that |Av| = A|w| for any positive 
constant A, it follows that c\v\g < \v\g < C\v\g whenever v G T^M, r(x) < 
e. 

Now let y be the geodesic ball y = {x : r(x) < s} C M, and let 6 = ce. 
Whenever {x,v) G ys, our choices guarantee that \v\g < {l/c)\v\g < e, so 
ysCXcO. 

Since F is a diffeomorphism on and takes (p, 0) to (p,p), there is a 
product open set W x W C M x M such that (p,p) G W x W C F(y^). 
Shrinking W if necessary, we may also assume that W C y. We make two 
claims about the set W: for any q G W, (1) exp^ is a diffeomorphism on 
Bs{0) C TqM; and (2) W C exp^(i?,5(0)). It follows from these claims that 
W is the required uniformly normal neighborhood of p. 

To prove claim (1), observe first that for each g G W, exp^ is at least de- 
finedon Bs{0) C TqM; F is defined on the set ys, so F{q, V) = (g, exp^ V) is 
defined whenever \V\g < 6. Because F has the form F{q, V) = {q, exp^ V), 
its inverse has the similar form F~^{q,y) = (q,(p{q,y)) for some smooth 
map (fi. Let’s use the notation (pq{y) = (p{q,y). Then, because F~^ o F is 
the identity on ys, it follows that oexp^ is the identity on Bs{Q) C TqM 
for each g G W C y. Similarly, FoF~^ = Id on F(y^) implies that exp^ oipq 
is the identity on expq{Bs{0)), so claim (1) is proved. 

Finally, we turn to claim (2). Let {q,y) G W x W be arbitrary. Since 
W X W C F{ys), there is some V G Bs{0) C TqM such that (g, p) = 
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FIGURE 5.6. The sets C X C 0. 


F{q, V) = {q,exp^ V). This says precisely that y = exp^ V, which was to 
be proved. □ 


Geodesics of the Model Spaces 

In this section we determine the geodesics of our three classes of model 
Riemannian manifolds defined in Chapter 3. 


Euclidean Space 

On R" with the Euclidean metric g, the metric coefficients are constants in 
the standard coordinate system, so it follows immediately from (5.4) that 
the Christoffel symbols are all zero in these coordinates. This means that 
the Riemannian connection on Euclidean space is exactly the Euclidean 
connection (4.4). Therefore, as one would expect, the Euclidean geodesics 


T^€Lth.c.i*L€Lti.c.€LL 







82 


5. Riemannian Geodesics 


are straight lines, and constant-coefficient vector fields are parallel (Exer¬ 
cises 4.8 and 4.9). 

Spheres 

On the 2-sphere S|j of radius R, it is not terribly difficult to compute the 
Christoffel symbols directly in, say, spherical coordinates, and to show that 
the meridians (lines of constant longitude) are geodesics, as in the following 
exercise. 

Exercise 5.7. Define spherical coordinates {d,ip) on the subset — 
{{x,y, z) : X < 0,y — 0} of the sphere by 

{x,y,z) = (i?sin^cos0,i?sin(psin6,i?cosy>), —tv <0<7r, 0<y><7r. 

(These are a special case of the coordinates for snrfaces of revolution con- 
strncted in Exercise 3.3.) 

(a) Show that the ronnd metric of radins i? is -\- FS sin^ y> dO^ 

in spherical coordinates. 

(b) Compute the Christoffel symbols of in spherical coordinates. 

(c) Using the geodesic equation (4.11) in spherical coordinates, verify that 
each meridian {9{t),ip{t)) = {9o,t) is a geodesic. 

As you can see from doing this exercise, even in a simple case like this, 
verifying the geodesic equation directly can involve a rather large number 
of calculations. When the metric is more complicated or the number of 
dimensions is high (or even when we attempt to identify arbitrary geodesics 
on the 2 -sphere instead of just the “vertical” ones), this direct approach can 
become prohibitively difficult, so we must often look for other techniques 
to analyze geodesics. 

Fortunately, the fact that the sphere is homogeneous and isotropic gives 
us a much easier way to determine the geodesics in all dimensions. 

Proposition 5.13. The geodesics on are precisely the “great circles” 
{intersections o/ with 2-planes through the origin), with constant speed 
parametrizations. 

Proof. First we consider a geodesic 7 (t) = {x^ (t),..., (t)) starting 

at the north pole N whose initial velocity U is a multiple of di. It is 
intuitively evident by symmetry that this geodesic must remain along the 
meridian = • • • = a;" = 0. To make this intuition rigorous, suppose 
not; that is, suppose there were a time to such that x*(to) ^ 0 for some 
2 < i < n. The linear map p: ^ sending x* to — x* and leaving 

the other coordinates fixed is an isometry of the sphere that fixes N = 7(0) 
and V = 7(0), and therefore it takes 7 to 7 . But (^( 7 (^ 0 )) yf 7 (^ 0 )) a 
contradiction (Figure 5.7). 
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Since geodesics have constant speed, the geodesic with initial point N 
and initial velocity cdi must therefore be the circle where intersects the 
(x^, a;"“'"^)-plane, with a constant speed parametrization. Since there is an 
orthogonal map taking any other initial point to N and any other initial 
vector to one of this form, and since orthogonal maps take planes through 
the origin to planes through the origin, it follows that the geodesics on 
are precisely the intersections of with 2-planes through the origin. □ 


Hyperbolic Spaces 

The geodesics of are easily determined using homogeneity and isotropy, 
as in the case of the sphere. 

Proposition 5.14. The geodesics on the hyperbolic spaces are the follow¬ 
ing curves, with constant speed parametrizations: 
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FIGURE 5.8. Geodesics of the Poin- FIGURE 5.9. Geodesics of the Poin¬ 
care ball. care half-space. 


Hyperboloid model: The “great hyperbolas,” or intersections of 
with 2-planes through the origin. 

Ball model: The line segments through the origin and the circular arcs 
that intersect orthogonally {Figure 5.8). 

Half-space model: The vertical half-lines and the semicircles with cen¬ 
ters on the y = 0 hyperplane {Figure 5.9). 

Proof. We begin with the hyperboloid model. As with the sphere, the geo¬ 
desic starting at N with initial tangent vector parallel to d/dff must remain 
in the plane by symmetry (Figure 5.10), and therefore must be a 

constant speed parametrization of the hyperbola where this plane intersects 
Since is homogeneous and isotropic, and 0+(n, 1) takes 2-planes 
through the origin to 2-planes through the origin, the result follows. 

For the ball model, first consider the 2-dimensional case, and recall the 
hyperbolic stereographic projection tt: —> B|j, constructed in Chapter 

3: 


7r(^,r) =u = 


Rj 

R + t' 


TT ^{u) = {f,T) 


f 2R^u 


A geodesic in the hyperboloid model is the set of points on H|j. that solve 
a linear equation -\- Pt = 0, with a constant speed parametrization. 
In the special case /3 = 0, this hyperbola is mapped by tt to a straight 
line segment through the origin, as can easily be seen from the geometric 
definition of tt. If /3 yf 0, we can divide through by —/3 and write the linear 
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FIGURE 5.10. A great hyperbola. 


equation as r = = (a,^) (for a different covector a). Under tt this 

pulls back to the equation 

+ |mP 2R'^{a,u) 

= i?2_ |y|2 

on the disk, which simplifies to 

\u\^ - 2R{a,u) + = 0. 

Completing the square, we can write this as 

\u- Ral"^ = R^{\a\^ -1). (5.12) 

If |ap < 1 this locus is either empty or a point on 9B|., so it does not 
define a geodesic. When |ap > 1, this is the circle with center Ra and 
radius R^/\a\'^ — 1. At a point Uq where the circle intersects 9B|j, the 
three points 0, uq, and Ra form a triangle with sides |t6o| = R, |i?a|, 
and |uo — Ra\ (Figure 5.11), which satisfy the Pythagorean identity by 
(5.12); therefore the circle meets i9B|j in a right angle. By the existence 
and uniqueness theorem, it is easy to see that the line segments through 
the origin and the circular arcs that intersect i9B|j, orthogonally are all the 
geodesics. 
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Uo 



In the higher-dimensional case, a geodesic on is determined by a 
2-plane. If the 2-plane contains the point N, the corresponding geodesic on 
is a line through the origin as before. Otherwise, we can conjugate with 
an orthogonal transformation in the ... ,^") variables (which preserves 
ha) to move this 2-plane so that it lies in the t) subspace, and 

then we are in the same situation as in the 2-dimensional case. 

Now consider the upper half-space model. The 2-dimensional case is eas¬ 
iest to analyze using complex notation. It is straightforward to check that 
the inverse of the complex Cayley transform (3.12) is 


K 


-1 


{z) = w = iR 


z — iR 
z + iR 


Substituting this into equation (5.12) and writing w = u+iv and a = a+ib 
in place of m = (u^,m^), a = we get 


|z + zi?|2 


— iR^a 


z — iR 


z + iR 


iR?a 


z + iR 


z — iR 


R^\a? 


i?2(|ap-l). 


Multiplying through by (z + iR)(z — iR)/2R? and simplifying, 
(1 - b)\z\^ - 2aRx + {b+ l)R^ = 0. 


This is the equation of a circle with center on the cc-axis, unless 6 = 1, in 
which case the condition jap > 1 forces a yf 0, and then it is a straight 
line X = constant. The other class of geodesics on the ball, line segments 
through the origin, can be handled similarly. 

In the higher-dimensional case, we just conjugate k with a suitable or¬ 
thogonal transformation in the first n — 1 variables, and apply the usual 
symmetry arguments to show that the resulting geodesics remain in the 
(u^,u)-and (a;^,t/)-planes. □ 
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Problems 

5-1. Let V be a linear connection on a Riemannian manifold (M, g). Show 
that V is compatible with g if and only if the connection 1-forms uji^ 
(Problem 4-5) with respect to any local frame {Ei} satisfy 

9jk^i “t” — dgij. 

In particular, the matrix uii^ of connection 1-forms for the Riemann¬ 
ian connection with respect to any local orthonormal frame is skew- 
symmetric. 

5-2. Let M C be a surface of revolution, parametrized as in Exercise 
3.3. It will simplify the computations if we assume that the curve 7 
(called a generating curve for the surface) is unit speed. 

(a) Compute the Christoffel symbols of the induced metric in (0, t) 
coordinates. 

(b) Show that each “meridian” {9 = 60} is a geodesic on M. 

(c) Determine necessary and sufficient conditions for a “latitude cir¬ 
cle” {t = to} to be a geodesic. 

5-3. Let denote the n-dimensional hyperbolic space of radius R. 

(a) Determine the unit speed parametrization of the geodesic in 
the hyperboloid model starting at N = (0,...,i?) with initial 
tangent vector d/d^^. 

(b) Prove that each geodesic on is defined for all t € R, and 
that the image of each geodesic is an entire branch of a great 
hyperbola. 

5-4. Recall that a vector field V is said to be parallel if VP = 0. 

(a) Let p G R" and Vp G TpR". Show that Vp has a unique extension 
to a parallel vector field V on R". 

(b) Let U be the open subset of the unit sphere on which spherical 
coordinates {9,(p) are defined (see Exercise 5.7), and let V = 

d/dip in these coordinates. Compute and Vj^P, and 

d9 d<^ 

conclude that P is parallel along the equator and along each 
meridian 9 = 9o- 

(c) Let p = ( 0 , 7 r/ 2 ) in spherical coordinates. Show that Vp has no 
parallel extension to any neighborhood of p. 

(d) Use (a) and (c) to show that no neighborhood of p is isometric 
to an open subset of R^ . 
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5-5. Let {M,g) be a Riemannian manifold. If / is a smooth function on 
M such that | grad / | = 1, show that the integral curves of grad / are 
geodesics. 

5-6. Let {M,g) be an oriented Riemannian manifold, and div the diver¬ 
gence operator defined in Problem 3-3. 

(a) Show that \i X = X'^Ei in terms of some local frame, then div X 
can be written in terms of covariant derivatives as 

divX = X\^i. 

[Hint: Show that it suffices to prove the formula at the origin in 
normal coordinates.] 

(b) Now suppose M is a compact, oriented Riemannian manifold 
with boundary. Extend the integration by parts formula of Prob¬ 
lem 3-3 as follows: If lo is any fc-tensor field and g any A:-|-1-tensor 
field. 


f {Xuj,g)dV = — ( {ijj,irgXg) dV + f {uj(^N,g)dV, 
Jm JM JdM 


where the trace is on the last two indices of Xg. This is often 
written in the suggestive but not-quite-rigorous notation 

J M 

JM JdM 

5-7. If {M,g) and {M,g) are Riemannian manifolds, a map ip: M ^ M 
is a local isometry if each point p € M has a neighborhood If such 
that p\u is an isometry onto an open subset of M. Suppose M is 
connected, and suppose M ^ M are local isometries such that 

for some point p G M, (p{p) = ipip) and = ■)/)* at p. Show that 

5-8. Let E{n) be the Euclidean group described in Problem 3-6. 

(a) Show that U(R") = E{n), U(H^) = 0+(n, 1), and U(S^) = 

0{n + 1). 

(b) Strengthen the result above by showing that if M is one of our 
model Riemannian manifolds (M = R", H^, or S^), U,V are 
connected open subsets of M, and p: U ^ V \s an isometry, 
then p is the restriction to U of an element of U(M). 
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5-9. Suppose p: {M,g) —> (M,g) is a Riemannian submersion (Problem 
3-8). A vector field on M is said to be horizontal or vertical if its 
value is in the horizontal or vertical space at each point, respectively. 

(a) For any vector fields X,Y G 7{M), show that 

{X,Y)=p*{X,Yy, 

[X,Y]^ = (^]; 

[X, W] is vertical if W is vertical. 

(b) Let V and V denote the Riemannian connections of g and g, 
respectively. For any vector fields X,Y G 7{M), show that 

V^Y = ^+^[X,Y]^. (5.13) 

[Hint: Let Z be a horizontal lift and VF be a vertical vector field 
on M, and compute (y^Y,Z) and (V^y, VF) using formula 
(5.1).] 

5-10. Suppose G is a Lie group with Lie algebra g, and let (Ai,..., A„) be 
any basis of g. Define the structure constants by 

[x„x,] = j24^k. 

k 

For an arbitrary left-invariant metric g on G, compute the Christoffel 
symbols of the Riemannian connection (with respect to the basis 
{Ai}) in terms of and gtj. 

5-11. Let G be a Lie group and g its Lie algebra, and let g be a bi-invariant 
metric on G (see Problems 3-10 and 3-12). 

(a) For any X,Y,Z G g, show that 

([A,r],z) = -(y,[A,z]). 

[Hint: Let j(t) = exp(tX) (here exp denotes the Lie group ex¬ 
ponential map, not the Riemannian one), and compute the t- 
derivative of (Ad..y(t) Y, Ad.^(t) Z) at t = 0, using the facts that 
Adj(t) = and that R'y(-t) is the flow of —A.] 

(b) Show that 

Vxy = ^[A,r] 

whenever A and Y are left-invariant vector fields on G. 

(c) Show that the geodesics of g starting at the identity are exactly 
the one-parameter subgroups, so the Lie group exponential map 
coincides with the Riemannian exponential map at the identity. 





6 

Geodesics and Distance 


In this chapter, we study in detail the relationships among geodesics, 
lengths, and distances on a Riemannian manifold. A primary goal is to 
show that all length-minimizing curves are geodesics, and that all geodes¬ 
ics are length minimizing, at least locally. A key ingredient in the proofs 
is the symmetry of the Riemannian connection. Later in the chapter, we 
study the property of geodesic completeness, which means that all maxi¬ 
mal geodesics are defined for all time, and prove the HopRRinow theorem, 
which states that a Riemannian manifold is geodesically complete if and 
only if it is complete as a metric space. 

Throughout this chapter, M is a smooth n-manifold endowed with a 
fixed Riemannian metric g. All covariant derivatives and geodesics are un¬ 
derstood to be with respect to the Riemannian connection of g. 

Most of the results of this chapter do not apply to pseudo-Riemannian 
metrics, at least not without substantial modification. For a treatment of 
lengths of curves in the pseudo-Riemannian setting, see [0’N83]. 


Lengths and Distances on Riemannian Manifolds 

We are now in a position to introduce two of the most fundamental concepts 
from classical geometry into the Riemannian setting: lengths of curves and 
distances between points. We begin with lengths. 

"Htn'th.c.m.n'tLc.nl “Ph.y.S.t.c.A. 
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Lengths of Curves 

If 7 : [a, 6 ] ^ M is a curve segment, we define the length of 7 to be 

L{l)-=[ \i{t)\dt. 

J a 

Sometimes, for the sake of clarity, we emphasize the dependence on the 
metric by using the notation Lg instead of L. 

The key feature of the length of a curve is that it is independent of 
parametrization. To make this notion precise, we define a reparametrization 
of 7 to be a curve segment of the form 7 = 701 ^, where p: [c, d] —>■ [o, b] is a 
smooth map with smooth inverse. We say it is a forward reparametrization 
if ip is orientation preserving, and a backward reparametrization if not. 

Lemma 6.1. For any curve segment 7 : [a, 6 ] ^ M, and any reparametri¬ 
zation 7 0 / 7 , LO = LCl). 

Exercise 6.1. Prove Lemma 6.1. 

For measuring distances between points, it is useful to modify slightly the 
class of curves we consider. A regular curve is a smooth curve I M 
such that 7 (t) yf 0 for t € I. Intuitively, this prevents the curve from having 
“cusps” or “kinks.” More formally, because the tangent vector ^{t) is the 
push-forward 7 *(fi/(it), a regular curve is an immersion of the interval / into 
M. (If / has one or two endpoints, it has to be considered as a manifold 
with boundary.) Note that geodesics are automatically regular, since they 
have constant speed. 

A continuous map 7 : [a, 6] ^ M is called a piecewise regular curve seg¬ 
ment if there exists a finite subdivision a = Qq < ai < • • • < Ok = b such 
that 7 |[aj_i,ai] is a regular curve for i = 1,..., fc. All distances on a Rie- 
mannian manifold will be measured along such curve segments. For brevity, 
we refer to a piecewise regular curve segment as an admissible curve. It’s 
also convenient to allow a trivial constant curve 7 : {a} —> M, 7 (a) = p, to 
be considered an admissible curve. 

The definition implies that an admissible curve must have well-defined, 
nonzero, one-sided velocity vectors when approaching Oi from either side, 
but the two limiting velocity vectors need not be equal. We denote these 
one-sided velocities by 


7 ( 0 ^ ) := lim 7 (t); 

t xai 

iiaf) ■= lim i{t). 

t\ai 


Let 7 : [a,b] M be an admissible curve, and a = ag < ai <■•• < 
Uk = b a subdivision as above. The length of 7 is defined simply as the 
sum of the lengths of the smooth subsegments 7 |[aj_i,oi]- We can broaden 
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the definition of reparametrization by defining a reparametrization of an 
admissible curve 7 : [a,b] —tM to be an admissible curve of the form 7 = 
') o (p, where p: [c, cf| ^ [a,b] is a homeomorphism whose restriction to 
each subinterval [ci-i,Ci] is smooth with smooth inverse, for some finite 
subdivision c = cq < ci < • • • < Ck = d of [c,d]. Then a straightforward 
generalization of Lemma 6.1 shows that the length of an admissible curve 
is also independent of parametrization. 

The arc length function of an admissible curve 7 : [a, 5] ^ M is the 
function s: [a, 6 ] ^ R defined by 

s(t) := L( 7 |[a,t]) = / \j{u)\du. 

J a 

It is an immediate consequence of the fundamental theorem of calculus that 
s is smooth wherever 7 is, and s{f) is equal to the speed | 7 (t)| of 7 . 

Among all the possible parametrizations of a given curve, the unit speed 
parametrizations are particularly useful. It is an important fact that every 
admissible curve has such a parametrization, as the next exercise shows. 

Exercise 6.2. Let 7 : [a,h\ ^ M be an admissible curve, and set I = L{y). 

(a) Show that there exists a unique forward reparametrization 7 : [0, i] ^ 

M of 7 such that 7 is a unit speed curve. 

(b) If 7 is any unit speed curve whose parameter interval is of the form 
[0, 1], show that the arc length function of 7 is s{t) = t. For this reason, 
such a curve is said to be parametrized by arc length. 

If 7 : [a, 6 ] ^ M is any admissible curve, and / G C°°\a, &], we define the 
integral of f with respect to arc length, denoted / ds, by 

f fds:= f f{t) \f{t)\dt. 

J j J a 

Exercise 6.3. Let 7 : [a,h] ^ M be an admissible curve, and / G (7°° [a,b\. 

(a) Show that / ds is independent of parametrization. 

(b) If 7 is injective and smooth, show that C 7 ( 0 , 6 ] is an embedded 
submanifold with boundary in M, and 

j fds = j^if o'y~^) dV, 

where dV is the Riemannian volume element on C associated with the 
induced metric and the orientation determined by 7 . 

A continuous map V: [a, 6 ] ^ TM such that Vt G for all t is 

called a piecewise smooth vector field along 7 if there is a (possibly finer) 
finite subdivision a = do < di < ■ ■ ■ < dm = b such that V is smooth 
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FIGURE 6.1. Any two points can be 
connected by an admissible curve. 



FIGURE 6.2. The triangle inequal¬ 
ity. 


on each subinterval [di_i,di]. Given any vector 14 G it is easy to 

check that 14 has a unique piecewise smooth parallel translate along all 
of 7 ; simply parallel translate 14 along the first smooth segment to 7 ( 01 ), 
then parallel translate 14 i along the second smooth segment, and so on. 
The parallel translate is smooth wherever 7 is. 

The Riemannian Distance Function 

Suppose M is a, connected Riemannian manifold. For any pair of points 
p,q G M, we define the Riemannian distance d{p, q) to be the infimum 
of the lengths of all admissible curves from p to q. To check that this is 
well defined, we need to verify that any two points can be connected by 
an admissible curve. Since a connected manifold is path-connected, they 
can be connected by a continuous path c: [a,b] M. By compactness, 
there is a finite subdivision of [a,b] such that c[ai-i,ai\ is contained in a 
single chart for each i. Then we may replace each such segment by a smooth 
path in coordinates yielding an admissible curve 7 between the same points 
(Figure 6.1). Therefore d{p, q) is finite for each p,q G M. 

Lemma 6.2. With the distance function d defined above, any connected 
Riemannian manifold is a metric space whose induced topology is the same 
as the given manifold topology. 

Proof. It is obvious from the definition that d{p,q) = d{q,p) > 0 and 
d{p,p) = 0. The triangle inequality follows from the fact that an admissible 
curve from p to q can be combined with one from q to r (possibly changing 
the starting time of the parametrization of the second) to yield one from p 
to r whose length is the sum of the lengths of the two given curves (Figure 
6.2). (This is one reason for defining distance using piecewise regular curves 
instead of just regular ones.) 

It remains to show that d{p, g) > 0 when p q, and that the metric 
topology is the same as the manifold topology. To do so, we need to compare 
the Riemannian distance to the Euclidean distance in local coordinates. Let 
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p G M, and let (a;*) be normal coordinates centered at p. Arguing as in the 
proof of the uniformly normal neighborhood lemma (Lemma 5.12), there 
exists a closed geodesic ball y of radius e: around p and positive constants 
c and C such that c\V\g < \V\g < C\V\g whenever V G T^M and a; G y. 
It follows immediately from the definition of length that for any admissible 
curve 7 whose image is contained in y, 

cLg{-t) < Lg{-f) < CLg{j). (6.1) 

Now if q p, we may shrink e: so that g ^ y. Then any admissible 
curve 7 : [a,b] ^ M from p to q must pass through the geodesic sphere i9y 
(since the complement of the sphere is disconnected, and p, q lie in different 
components). If we let to denote the first such time (Figure 6.3), it follows 
that 

d{p,q) > Lgij) > Lg(7l[a,to]) > cLg(7l[a,to]) > cdg{p,j{to)) = C£ > 0. 


Thus d is a metric. 

Finally, to compare the two topologies, just note that we can construct a 
basis for the manifold topology from small Euclidean balls in open sets of 
the form y as above, and the metric topology is generated by small metric 
balls. The discussion above shows that in any such set y, the Euclidean 
distance and the Riemannian distance are equivalent, so the basis open sets 
in either topology are open in both. This shows that the two topologies are 
the same. □ 
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Geodesics and Minimizing Curves 

An admissible curve 7 in a Riemannian manifold is said to be minimizing 
if -^( 7 ) < ^( 7 ) for any other admissible curve 7 with the same endpoints. 
It follows immediately from the definition of distance that 7 is minimizing 
if and only if ^( 7 ) is equal to the distance between its endpoints. 

To show that all minimizing curves are geodesics, we will think of the 
length function A as a functional on the set of admissible curves in M. 
(Functions whose domains are themselves sets of functions are usually 
called “functionals.”) From this point of view, the search for minimizing 
curves can be thought of as searching for minima of this functional. 

From calculus, we might expect that a necessary condition for a curve 7 
to be minimizing would be that the “derivative” of L vanish at 7 , in some 
sense. This brings us to the brink of the subject known as the calculus of 
variations: the use of calculus to identify and analyze extrema of function¬ 
als defined on spaces of functions or maps. In its fully developed state, the 
calculus of variations allows one to apply all the usual tools of multivari¬ 
able calculus in the infinite-dimensional setting of function spaces, such 
as directional derivatives, gradients, critical points, local extrema, saddle 
points, and Hessians. For our purposes, however, we do not need to for¬ 
malize the theory of calculus in the infinite-dimensional setting. It suffices 
to note that if 7 is a minimizing curve, and F^ is a family of admissible 
curves with the same endpoints such that ^(F^) is a differentiable function 
of s and Fq = 7 , then by elementary calculus L(Ts) must have vanishing 
s-derivative at s = 0 because it attains a minimum there. 


Admissible Families 

To make this rigorous, we introduce some more definitions. An admissible 
family of curves is a continuous map F: (—e, e) x [a, b] ^ M that is smooth 
on each rectangle of the form (—e, e) x a^] for some finite subdivision 

a = ao <•••< Ok = b, and such that Fs(t) := F(s,t) is an admissible 
curve for each s G (—s,s) (Figure 6.4). If F is an admissible family, a 
vector field along F is a continuous map V: (—£,£) x [a,b] TM such 
that V{s,f) e Tr(s,t)M for each (s,t), and such that R|(-£,e)x[oi_i,ai] is 
smooth for some (possibly finer) subdivision a = Qq < • ■ ■ < dm = b. 

Any admissible family F defines two collections of curves: the main curves 
Ts{t) = F(s,<) defined on [a, 5] by setting s = constant, and the transverse 
curves F(*i(s) = F(s,t) defined on (—£,£) by setting t = constant. The 
transverse curves are smooth on (—£, £) for each t, while the main curves 
are in general only piecewise regular. Wherever F is smooth, the tangent 
vectors to these two families of curves are examples of vector fields along 
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FIGURE 6.4. An admissible family. 


F; we denote them by 

atr(s,t) := jTsit); dsT{s,t) := 

In fact, is always continuous on the whole rectangle (—er, e) x [a, b]: on 
one hand, its value along the line segment (—£, s) x {oi} depends only on the 
values of F on that segment, since the derivative is taken only with respect 
to the s variable; on the other hand, it is continuous (in fact smooth) on 
each subrectangle (—£,£) x and (—£,£:) x [ai,ai+i], so the right- 

handed and left-handed limits at t = must be equal. Therefore S^F is 
always a vector field along F. (However, i9tF is not usually continuous at 

t = Qi.) 

If U is a vector field along F, we can compute the covariant derivative 
of V either along the main curves or along the transverse curves, at least 
where the former are smooth; the resulting vector fields along F are denoted 
DtV and DsV respectively. 

As mentioned earlier, a key ingredient in the proof that minimizing curves 
are geodesics is the symmetry of the Riemannian connection. It enters into 
our proofs in the form of the following lemma. (Although we state and use 
this lemma only for the Riemannian connection, the proof shows that it is 
actually true for any symmetric connection.) 

Lemma 6.3. (Symmetry Lemma) Let F: (—£, £) x [a, b] ^ M be an ad¬ 
missible family of curves in a Riemannian {or pseudo-Riemannian) mani¬ 
fold. On any rectangle (—£,£) x [ai-i,ai] where F is smooth, 

DAT = DtdsT. 


Proof. This is a local question, so we may compute in coordinates (x*) 
around any point F(so,to)- Writing the components of F as F(s,t) = 
(x^(s, t),..., x”(s, t)), we have 

5tF = — d,T = —du. 
at os 
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of 7 . 


Then, using the coordinate formula (4.10) for covariant derivatives along 
curves, 


DsdtT 


DtdsT 


d'^x^ dx^ dx^ ^ f) 
d'^x^ dx^ dx^ ^ f) 


Reversing the roles of i and j in the second line above, and using the sym¬ 
metry condition = T^ we see immediately that these two expressions 
are equal. □ 


If 7 : [a,b] M is an admissible curve, a variation of 7 is an admis¬ 
sible family T such that ro(t) = 7 ( 1 ) for all t G [a, &]. It is called a 
proper variation or fixed-endpoint variation if in addition rs(a) = 7 (a) 
and rs( 6 ) = 7 ( 6 ) for all s. If T is a variation of 7 , the variation field of T is 
the vector field V{t) = i9sr(0,t) along 7 . A vector field V along 7 is proper 
if V(a) = V(b) = 0. It is clear that the variation field of a proper variation 
is itself proper. 

Lemma 6.4. If 7 is an admissible curve and V is a vector field along 7 , 
then V is the variation field of some variation of 7 . If V is proper, the 
variation can be taken to be proper as well. 


Proof Set r(s, t) = exp(sR (t)) (Figure 6.5). By compactness of [a, b], there 
is some positive e such that F is defined on (—£,£) x [a, &]. Clearly F is 
smooth on (—£,£) x for each subinterval on which V is 

smooth, and is continuous on its whole domain. By the properties of the 
exponential map, the variation field of F is V. Moreover, if V{a) = V (5) = 
0, it is immediate that F(s, a) = 7 ( 0 ) and F(s, b) = 7 ( 6 ), so F is proper. □ 
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FIGURE 6 . 6 . Ai 7 is the “jump” in 7 at Oi. 


Minimizing Curves Are Geodesics 

We can now compute an expression for the derivative of the length func¬ 
tional along a proper variation. Traditionally, the derivative of a functional 
on a space of maps is called its first variation. 

Proposition 6.5. (First Variation Formula) Let 7 : [a,b] ^ M be any 

unit speed admissible curve, T a proper variation of 7 , and V its variation 
field. Then 


L{^s) = - 

where = 7 ( 0 ^) — 7 ( 0 ”) is the “jump” in the tangent vector field 7 at 
Oi {Figure 6.6). 


d 

ds 


H—l 


(y, Da) dt-} {V{ai ), A^ 7 ) , 


( 6 . 2 ) 


Proof. For brevity, denote 


T{s,t) = dtT{s,t), S{s,t) = dsT{s,t). 


On any subinterval [ai-i,ai\ where F is smooth, since the integrand in 
L{Ts) is smooth and the domain of integration is compact, we can differ¬ 
entiate under the integral sign to obtain 


d 

ds 


L{^s I [oi_i ,ai]) 


r hT,T)-^/^ 2{D,T,T) dt 
J ai — i ^ 

£7^5.0^,. 


(6.3) 
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where we have used the symmetry lemma in the last line. Setting s = 0 
and noting that S'( 0 ,t) = V(t) and T( 0 ,t) = j^t) (which has length 1 ), 

^ L{Ts\la,_„a,])= f ^ {DtV,i) dt 

- [ {V, At) dt. 

Finally, summing over i and noting ^(ao) = V{ak) = 0 because F is a 
proper variation, we obtain (6.2). □ 

Because any admissible curve has a unit speed parametrization and 
length is independent of parametrization, the requirement in the above 
proposition that 7 be unit speed is not a real restriction, but rather just a 
computational convenience. 

Exercise 6 . 4 . Let 7 be a smooth, unit speed curve. 

(a) Show that Dty{t) is orthogonal to 7(1) for all t. 

(b) If r is a proper variation of 7 such that for all s, Fs is a reparametri- 
zation of 7, show that the first variation of F/(rs) vanishes. 


Theorem 6.6. Every minimizing curve is a geodesic when it is given a 
unit speed parametrization. 

Proof. Suppose 7: [a, 6] ^ M is minimizing and unit speed, and let a = 
ao < • • • < Ofc = & be a subdivision such that 7 is smooth on [ai_i,ai]. If 
F is any proper variation of 7, we conclude from elementary calculus that 
dL(Ts)/ds = 0 when s = 0 . Since every proper vector field along 7 is the 
variation field of some proper variation, the right-hand side of (6.2) must 
vanish for every such V. 

The first step is to show that Dt^ = 0 on each subinterval [ai-i,ai], so 
7 is a “broken geodesic.” Choose one such interval, and let ip G (^“(R) be 
a bump function such that > 0 on (ai_i,ai) and p = 0 elsewhere. Then 
( 6 . 2 ) with V = pDtj becomes 

0 = - / p\Dtf\'^ dt. 

J Gt —1 

Since the integrand is nonnegative, this shows that Dt^ = 0 on each such 
subinterval. 

Next we need to show that Ai7 = 0 , which is to say that 7 has no corners. 
For any i between 0 and k, it is easy to use a bump function in a coordinate 
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FIGURE 6.7. Deforming 7 in the di- FIGURE 6 . 8 . Rounding the corner, 
rection of its acceleration vector. 


chart to construct a vector field V along 7 such that V(ai) — Aij and 
V(aj) = 0 for j ^ i. Then ( 6 . 2 ) reduces to —|Ai7p = 0 . 

Finally, since the two one-sided velocity vectors of 7 match up at each Oj, 
it follows from uniqueness of geodesics that 'y\[ai,ai+i] is the continuation 
of the geodesic 7|[oj_i,ai]) and therefore 7 is smooth. □ 

The preceding proof has an enlightening geometric interpretation. As¬ 
suming 11(7 yf 0 , the first variation with V = (pDt'j is negative, which 
shows that deforming 7 in the direction of its acceleration vector decreases 
its length (Figure 6 . 7 ). Similarly, the length of a broken geodesic 7 is 
decreased by deforming it in the direction of a vector field V such that 
V{ai) = Ai7 (Figure 6.8). Geometrically, this corresponds to “rounding 
the corner.” 

The first variation formula actually tells us a bit more than is claimed in 
Theorem 6.6. In proving that 7 is a geodesic, we didn’t use the full strength 
of the assumption that it is a minimizing curve—we used only the fact that 
it is a critical point of L, which means that for any proper variation Fg of 
7, the derivative of L(Ts) with respect to s is zero at s = 0 . Therefore we 
can strengthen Theorem 6.6 in the following way. 

Corollary 6 . 7 . A unit speed admissible curve j is a critical point for L if 
and only if it is a geodesic. 

Proof. If 7 is a critical point, the proof of Theorem 6.6 goes through without 
modification to show that 7 is a geodesic. Conversely, if 7 is a geodesic, 
then the first term in the second variation formula vanishes by the geodesic 
equation, and the second term vanishes because 7 has no jumps. □ 

The geodesic equation £>47 = 0 thus characterizes the critical points 
of the length functional. In general, the equation that characterizes critical 
points of a functional on a space of maps is called the variational equation or 
the Euler-Lagrange equation of the functional. Many interesting equations 
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FIGURE 6.9. Proof of the Gauss lemma. 


in differential geometry arise as variational equations. We touch briefly on 
three others in this book: the Einstein equation (Chapter 7 ), the Yamabe 
equation (Chapter 7 ), and the minimal surface equation (Chapter 8). 


Geodesics Are Locally Minimizing 

Next we turn to the converse of Theorem 6.6, and show that geodesics are 
locally minimizing. The proof is based on the following deceptively simple 
geometric fact. 

Theorem 6.8. (The Gauss Lemma) Let U be a geodesic ball centered 
at p & M. The unit radial vector field d/dr is g-orthogonal to the geodesic 
spheres in It. 

Proof. Let q € U and let Y G T^M be a vector tangent to the geodesic 
sphere through q. Because exp^ is a diffeomorphism onto It, there is a vector 
V G TpM such that q = exppV, and there is a vector W G Tv{TpM) = 
TpM such that X = (expp)*lT (Figure 6 . 9 ). Then V G dBji(O) and W G 
TvdBjilO), where R = d{p,q)- The radial geodesic from p to g is y{t) = 
expp{tV), with tangent vector j{t) = Rd/dr. Thus we need to show that 
X -L 7(1) with respect to g. 

Choose a curve a: TpM lying in 9 i?/j( 0 ) such that cr( 0 ) = V 

and (t( 0 ) = W, and consider the variation T of 7 (Figure 6 . 10 ) given by 


T{s,t) = expp{ta{s)). 
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FIGURE 6.10. The variation F. 


For each s G (— e,£), (t{s) is a vector of length R, so Fg is a geodesic with 
constant speed R. As before, let S = dsT and T = dtV. It follows from the 
definitions that 


S{ 0 , 0 ) 

r(o,o) 

S{Q,l) 

T( 0 , 1 ) 


_d 

ds 

dt 

_d 

ds 

d 

dt 


expp(O) = 0; 

expp(tU) = V; 

t^O 

expp(a(s)) = (expp)*(T(0) = A; 
expp(tU) = 7(1). 

i=l 


Therefore {S,T) is zero when (s,t) = ( 0 , 0 ) and equal to (A, 7(1)) when 
(s, t) = ( 0 , 1 ), so to prove the theorem it suffices to show (S', T) is indepen¬ 
dent of t. 

We compute 


Ft 


{S,T} 


(AS,r) + (s,AT) 


{D,T,T) + 0 


where we have used (1) the symmetry lemma DtS = DgT, (2) the fact that 
DtT = 0 since each Fg is a geodesic, and ( 3 ) the fact that \T\ = [Fgl = 
for all (s,t). This proves the theorem. 


We will use the Gauss lemma primarily in the form of the next corollary. 

Corollary 6.9. Let (a;*) be normal eoordinates on a geodesie hall If cen¬ 
tered atp€ M, and let r he the radial distance function as defined in ( 5 . 9 ). 
Then gradr = d/dr on U — {p}. 
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FIGURE 6.11. Decomposition of Y into tangential and normal compo¬ 
nents. 


Proof. For any g € U — {p} and Y gT^M, we need to show that 

= ( 6 - 4 ) 

The geodesic sphere expp(i9i?ij(0)) through q is characterized in normal 
coordinates by the equation r = R. Since d/dr is transverse to this sphere, 
we can decompose Y as a d/dr+ X for some constant a and some vector X 
tangent to the sphere (Figure 6.11). Observe that dr(d/dr) = 1 by direct 
computation in coordinates, and dr{X) = 0 since X is tangent to a level 
set of r. (This has nothing to do with the metric!) Therefore the left-hand 
side of (6.4) is 

+ dr{X) = a. 


On the other hand, by Proposition 5.11(e), d/dr is a unit vector. There¬ 
fore, the right-hand side of (6.4) is 


9 d ^ , 

+ X ) = a 

dr dr 


_9 

dr 


d 


where we have used the Gauss lemma to conclude that X is orthogonal to 
d/dr. □ 
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Proposition 6.10. Suppose p G M and q is contained in a geodesic hall 
around p. Then {up to reparametrization) the radial geodesic from p to q 
is the unique minimizing curve from p to q in M. 


Proof Choose e: > 0 such that expp(i?£(0)) is a geodesic ball containing 
q. Let 7 : [0, i?] ^ M be the radial geodesic from p to q parametrized by 
arc length, and write 7 (t) = expp{tV) for some unit vector V G TpM. 
Then L{'j) = R since 7 has unit speed, so we need to show that any 
other admissible curve from p to q has length strictly greater than R. Let 
Sn = expp(9i?/j(0)) denote the geodesic sphere of radius R. 

Let CT: [0, 6 ] —> M be such a curve, which we may assume to be param¬ 
etrized by arc length as well. We begin by showing L{a) > L{'j). 

Let oo G [a, b] denote the last time that a{t) = p and bo G [a, b] the first 
time after oq that a{t) G Sr (Figure 6.12). For any t G (ao, 6 o]) we can 
decompose &{t) as 


a{t) 



+ X{t), 


where X{t) is tangent to the geodesic sphere through a{t). By the Gauss 
lemma, this is an orthogonal decomposition, so |d-(<)p = a{t)‘^ + > 
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Moreover, by Corollary 6.9, a{t) = {d/dr,&{t)) = dr{a{t)). Therefore 


L{a) > L{a\[ao,bo]) 

fbo 


= lim 


J ao+S 
pbo 


|(T(i)| dt 


> lim 

/ 

a{t) dt 

<5^0, 

J ao+5 


fbo 


= lim 

/ 

dr{&{t)) dt 

(5^0 , 

J ao+5 



fbo 

d , , 

= lim 

/ 

—r((T(t)) dt 

i5^0, 

J ao+5 

dt 

= ricribo)) - 

-r{a{ao)) 

= R = 




(6.5) 


Thus 7 is minimizing. 

Now suppose L{a) = R. Then both inequalities in (6.5) are equalities. 
Because we assume ct is a unit speed curve, the first equality implies that 
tto = 0 and bo = b = R, since otherwise the segments of a before t = ao and 
after t = bo would contribute positive lengths. The second equality implies 
that X{t) = 0 and a{t) > 0, so a{t) is a positive multiple of d/dr. For a to 
have unit speed we must have &{t) = d/dr. Thus cr and 7 are both integral 
curves oi d/dr passing through q at time t = R, so a = j. □ 

Corollary 6.11. Within any geodesic ball around p G M, the radial dis¬ 
tance function r{x) defined by (5.9) is equal to the Riemannian distance 
from p to X. 


Proof. The radial geodesic 7 from p to a: is minimizing by Proposition 6.10. 
Since its velocity is equal to d/dr, which is a unit vector in both the g norm 
and the Euclidean norm in normal coordinates, the p-length of 7 is equal 
to its Euclidean length, which is r{x). □ 

This corollary suggests a simplified notation for geodesic balls and 
spheres in M. If l( = expp{Bji{0)) is a geodesic ball around p, Corollary 
6.11 shows that IX is equal to the metric ball of radius R around p. Similarly, 
a geodesic sphere of radius R is the set of points whose distance from p is 
exactly R. From now on, we will use the notations Bn{p) = expp(iXjj(0)), 
Br{p) = expp(i?/j(0)), and Sr{p) = expp{dBR{0)) for open and closed 
geodesic balls and geodesic spheres, which are exactly those metric balls 
and spheres that lie within a normal neighborhood of p. 

We say a curve 7 : / ^ M is locally minimizing if any to G I has a 
neighborhood IX C / such that 7 |u is minimizing between each pair of its 
points. Note that a minimizing curve is automatically locally minimizing, 
because it is minimizing between any two of its points. 
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FIGURE 6.13. Geodesics are locally minimizing. 


Theorem 6.12. Every Riemannian geodesic is locally minimizing. 


Proof. Let 7: I M he a, geodesic, which we may assume to be defined on 
an open interval, and let to G I- Let W be a uniformly normal neighborhood 
of 7 (to)) and let It C / be the connected component of 7“^(W) containing 
tf). If ti,t2 G and Qi = the definition of uniformly normal neigh¬ 

borhood implies that q2 is contained in a geodesic ball around qi (Figure 
6 . 13 ). Therefore, by Proposition 6 . 10 , the radial geodesic from qi to <72 is 
the unique minimizing curve between them. However, the restriction of 7 
is a geodesic from qi to (72 lying in the same geodesic ball, and thus 7 must 
itself be this minimizing geodesic. □ 


It is interesting to note that the Gauss lemma and its corollary also yield 
another proof that minimizing curves are geodesics, without using the first 
variation formula. On the principle that knowing more than one proof of 
an important fact always deepens our understanding of it, we present this 
proof for good measure. 


Another proof of Theorem 6 . 6 . Suppose 7: [a,b] M is any minimizing 
curve segment. Just as in the preceding proof, for any to G [a, b] we can 
find a connected neighborhood 11 of to such that 7(11) is contained in a 
uniformly normal neighborhood W. Then for any ti,t2 G 11 , the same 
argument as above shows that the unique minimizing curve from 7(ti) to 
7(12) is the radial geodesic joining them. Since the restriction of 7 is such 
a minimizing curve, it must coincide with this radial geodesic. Therefore 7 
solves the geodesic equation in a neighborhood of Iq. Since to was arbitrary, 
7 is a geodesic. □ 
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FIGURE 6.14. 7 extends past q. 


Completeness 

A Riemannian manifold is said to be geodesically complete if every maximal 
geodesic is defined for all f € R. It is easy to construct examples of mani¬ 
folds that are not geodesically complete; for example, in any proper open 
subset of R” with its Euclidean metric, there are geodesics that reach the 
boundary in finite time. Similarly, on R" with the metric obtained 

from the sphere by stereographic projection, there are geodesics that escape 
to infinity in finite time. The following theorem provides a simple criterion 
for determining when a Riemannian manifold is geodesically complete. 

Theorem 6.13. (Hopf—Rinow) A connected Riemannian manifold is 
geodesically complete if and only if it is complete as a metric space. 

Proof Suppose first that M is complete as a metric space but not geo¬ 
desically complete. Then there is some unit speed geodesic 7 : [0,6) ^ M 
that extends to no interval [0, b + e) for £ > 0. Let {ti} be any increasing 
sequence that approaches b, and set qi = j{ti). Since 7 is parametrized by 
arc length, the length of 7 |[ti,tj] is exactly \tj — ti\, so d{qi,qj) < \tj — ti\ 
and {qi} is a Cauchy sequence in M. By completeness, {qi} converges to 
some point q G M. 

Let W be a uniformly normal neighborhood of q, and let 6 > 0 be chosen 
so that W is contained in a geodesic 6 -ball around each of its points. For 
all large j, qj G W (Figure 6.14), and by taking j large enough, we may 
assume tj > b —6. The fact that Bg{qj) is a geodesic ball means that every 
geodesic starting at qj exists at least for time 6 . In particular, this is true 
of the geodesic a with cr(0) = qj and (t(0) = ^{tj). But by uniqueness of 
geodesics, this must be simply a reparametrization of 7 , so f{t) = a{tj +1) 
is an extension of 7 past b, which is a contradiction. 
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FIGURE 6.15. Proof that 7|[o,s] aims at q. 


To prove the converse, we will actually prove something stronger: If there 
is one point p € M such that exp^ is defined on the whole tangent space 
TpM, then M is a complete metric space. 

Suppose p is such a point. We show first that given any other point 
q G M, there is a minimizing geodesic segment from p to g. If 7 : [0, b] ^ M 
is a geodesic segment, we say that 7 aims at q if ^ is minimizing and 

c^( 7 ( 0 ), q) = fi( 7 ( 0 ), 7 ( 6 )) + d{j{b),q). ( 6 . 6 ) 

(This, of course, would be the case if 7 were an initial segment of a minimiz¬ 
ing geodesic from 7 ( 0 ) to q.) It will suffice to show that there is a geodesic 
segment 7 that begins at p, aims at q, and has length equal to d{p, q), for 
then ( 6 . 6 ) says that 

d{p, q) = d{p, q) + d{'y{b),q), 

which implies 7 ( 6 ) = q. Since 7 is assumed to be minimizing, it is the 
desired geodesic segment. 

Choose e > 0 such that Bg{p) is a closed geodesic ball around p. If 
q G Bg{p), there is a minimizing geodesic from p to (7 by Proposition 6.10, 
and we have nothing more to prove. If ^ Bg(p), since the distance function 
on any metric space is continuous there is a point x G S'e(p) where d{x, q) 
attains its minimum on the compact set S'e(p). Let 7 be the unit speed 
radial geodesic from p to a; (Figure 6.15); by assumption, 7 is defined for 
all time. 

We begin by showing that 7 |[o,£] aims at q. Since it is minimizing by 
Proposition 6.10, we need only show that ( 6 . 6 ) holds with 6 = £, or d{p, q) = 
d{p, x) + d{x, q). By the triangle inequality, the only way for this to fail is 
if d{p, q) < d{p, x) + d{x, q). Then there is a unit speed admissible curve cr 
from pto q whose length is strictly less than d{p, x) + d{x, q). Let ai denote 
the portion of tr inside Bs{p), and the rest (Figure 6.15). Then, since 
L((Ti) > e, 

d{p, x) + d{x, q) > L{a) 

> £ i((T2) 

= d{p,x) + L{a2). 
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But this means L{<T 2 ) < d{x,q), which contradicts our choice of x. 

Let T = d{p, q) and 

§ = {be [0,T] : 7 |[o, 6 ] aims at q}. 

We have just shown that e G §. Let A = supS > 0. By continuity of the 
distance function, it is easy to see that § is closed, and therefore A G 8. 
li A = T, then 7 |[o,t] is ^ geodesic of length T = d{p,q) that aims at q, 
and by the remark above we are done. So we assume A < T and derive a 
contradiction. 

Let y = "i{A), and choose ^ > 0 such that Bs{y) is a closed geodesic ball 
(Figure 6.16). The fact that A G 8 means 

d{y, q) = dip, q) - dip, y)=T - A. 

Let z G Ssiy) be a point where diz,q) attains its minimum, and let 
r: [0,5] ^ M be the radial geodesic from y to z. By exactly the same 
argument as before, t aims at q, so 

diz, q) = diy, q) - diy, z) = iT - A) - 6. (6.7) 

By the triangle inequality and (6.7), 

dip,z) > dip,q) - diz,q) 

= T -iT - A-b)= A + b. 

Therefore, the admissible curve consisting of 7 |[o,yi] (of length A) followed 
by r (of length 5) is a minimizing curve from p to z. This means it has no 
corners, so z must lie on 7 , and in fact z = 7 (zl + 8). But then (6.7) says 

dip, q)=T=iA + 8) + diz, q) = dip, z) + diz, q), 

so 7 |[o^yi+ 5 ] aims at q and A+8 G 8 , which is a contradiction. This completes 
the proof that there is a minimizing geodesic from p to q. 

Finally, we need to show that Cauchy sequences converge. Let {qi} be a 
Cauchy sequence in M. For each i, let 7 i(t) = expp(tCi) be a unit speed min¬ 
imizing geodesic from p to qt, and let di = dip, qi), so that qt = exppidiVi) 
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FIGURE 6.17. Cauchy sequences converge. 


(Figure 6.17). The sequence {di} is bounded in R (because Cauchy se¬ 
quences in any metric space are bounded), and the sequence {Vi} consists 
of unit vectors in TpM, so the sequence of vectors {diVi} in TpM is bounded. 
Therefore a subsequence converges to U G TpM. By continuity of 

the exponential map, qi^ = expp{di^Vi^) exp^ V, and since the original 
sequence {qi} is Cauchy, it converges to the same limit. This completes the 
proof of the Hopf-Rinow theorem. □ 

Because of this theorem, a connected Riemannian manifold is simply 
said to be complete if it is complete in either of the two equivalent senses 
discussed above. Complete manifolds are the natural setting for global ques¬ 
tions in Riemannian geometry. 

Exercise 6.5. Show that H^), and are complete. 

We conclude this chapter by stating three important corollaries, whose 
proofs are immediate. The first two are corollaries of the proof of the Hopf- 
Rinow theorem, while the last one follows from its statement. In all of these 
corollaries, M is assumed to be a connected Riemannian manifold. 

Corollary 6.14. If there exists one point p G M such that the restricted 
exponential map exp^ is defined on all of TpM, then M is complete. 

Corollary 6.15. M is complete if and only if any two points in M can he 
joined by a minimizing geodesic segment. 

Corollary 6.16. If M is compact, then every geodesic can he defined for 
all time. 
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Problems 


6-1. Define a connection on by setting 



_ pi — p2 _ 1 

— 1 23 ~ ^ 31 ~ 

— — —1 
— 1 32 — 1 13 ~ 


and all other Christoffel symbols to zero. Show that this connection is 
compatible with the Euclidean metric and has minimizing geodesics, 
but is not symmetric. 

6-2. We now have two kinds of “metrics” on a Riemannian manifold—the 
Riemannian metric and the distance function. Correspondingly, there 
are two definitions of “isometry” between Riemannian manifolds—a 
Riemannian isometry is a diffeomorphism that pulls one Riemannian 
metric back to the other, and a metric isometry is a homeomorphism 
that pulls one distance function back to the other. Prove that these 
two kinds of isometry are identical. [Hint: For the hard direction, first 
use the exponential map to show the homeomorphism is smooth.] 

6-3. Suppose M and M are Riemannian manifolds (not necessarily com¬ 
plete), and ipi'. M ^ M are Riemannian isometries that converge 
uniformly to a map ip: M ^ M. (This means that for any e > 0, 
there exists I such that d{pi{p),p{p)) < e for all p G M and all 
i > !■) Show that ip is a Riemannian isometry. 

6-4. A subset U of a Riemannian manifold M is said to be convex if for 
each p,q € li, there is a unique (in M) minimizing geodesic from pto q 
lying entirely in U. Show that every point has a convex neighborhood, 
as follows: 


(a) Let p G M he fixed, and let W be a uniformly normal neighbor¬ 
hood of p. For e > 0 small enough that i? 2 e(p) C W, define a 
subset Wg C TM X R by 


We = {{q,V,t) eTM xR: 

q€Bgip),V eTgM,\V\ = l,\t\<2e}. 


Define /: We ^ R by 

f{q,V,t) = d{expg{tV),pf. 

Show that / is smooth. [Hint: Use normal coordinates centered 
at p.] 

(b) Show that if e is chosen small enough, then d^f/dt^ > 0 on We. 
[Hint: Compute f{p,V,t) explicitly and use continuity.] 
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(c) If qi,q 2 G Bg{p) and 7 is a minimizing geodesic from <71 to ( 72 , 
show that d{'y{t),p) attains its maximum at one of the endpoints 
of 7 . 

(d) Show that Bg{p) is convex. 

6-5. If M is a complete Riemannian manifold and iV C M is a closed, 
embedded submanifold with the induced Riemannian metric, show 
that N is complete. [Warning: The distance function on N induced 
from the metric space structure of M is not in general equal to the 
Riemannian distance function of N.] 

6 - 6 . A curve 7 : [0,5) ^ M (0 < 5 < 00 ) is said to converge to infinity if 
for every compact set K C M, there is a time T G [0, 5) such that 
j(t) ^ K for t > T. (This means that 7 converges to the “point 
at infinity” in the one-point compactification of M.) Prove that a 
Riemannian manifold is complete if and only if every regular curve 
that converges to infinity has infinite length. (The length of a curve 
whose domain is not compact is just the supremum of the lengths of 
its restrictions to compact subintervals.) 

6-7. Show that any homogeneous Riemannian manifold is complete. 

6 - 8 . Suppose M is a complete Riemannian manifold that is isotropic at 
each point (see page 33). Show that M is homogeneous. [Hint: Given 
p,q G M, consider the midpoint of a geodesic joining p and q.] 

6-9. Generalize the first variation formula (Lemma 6.5) to the case of a 
variation that is not proper. 

6-10. Let be a closed, embedded submanifold of a Riemannian manifold 
M. For any point p G M — N, we define the distance from p to N to 
be 


d{p, N) := inf{(i(p, x) : x G N}. 

If <7 G iV is a point such that d{p, q) = d{p, N), and 7 is any minimizing 
geodesic from p to q, prove that 7 intersects N orthogonally. [Hint: 
Use Problem 6-9.] 

6-11. Suppose M and M are Riemannian manifolds, and p: M ^ M is a 
smooth covering map that is also a local isometry. If either M or M 
is complete, show that the other is also. 
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Curvature 


In this chapter, we begin our study of the local invariants of Riemann- 
ian metrics. Starting with the question of whether all Riemannian metrics 
are locally isometric, we are led to a definition of the Riemannian curva¬ 
ture tensor as a measure of the failure of second covariant derivatives to 
commute. Then we prove the main result of this chapter: A manifold has 
zero curvature if and only if it is flat, that is, locally isometric to Euclidean 
space. At the end of the chapter, we derive the basic symmetries of the cur¬ 
vature tensor, and introduce the Ricci and scalar curvatures. The results 
of this chapter apply essentially unchanged to pseudo-Riemannian metrics. 


Local Invariants 

An important question about Riemannian manifolds is the following: Are 
they all locally isometric (i.e., given Riemannian n-manifolds M,M and 
points p G M and p S M, is there necessarily an isometry from a neighbor¬ 
hood of p to a neighborhood of p)? Or are there nontrivial local invariants 
that must be preserved by isometries? This is not an idle question, since 
many interesting and useful structures in differential geometry do not have 
local invariants. Some examples are as follows: 

• Nonvanishing vector fields. In suitable coordinates, every nonvanish¬ 
ing vector field can be written locally as V = dfdx^, so they are all 
locally equivalent. 
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FIGURE 7.1. Result of parallel translation along the x^-axis and the 
a:^-coordmate lines. 


• Riemannian metrics on a 1-manifold. If 7 : / ^ M is a local unit 
speed parametrization of a Riemannian 1-manifold, then s = 7 “^ 
gives a coordinate chart in which the metric has the expression g = 
ds^. Thus every Riemannian 1-manifold is locally isometric to R. 

• Symplectic forms. A symplectic form is a closed 2-form w that is 
nondegenerate, i.e., uj{X, R) = 0 for all Y G TpM only if A = 0. The 
theorem of Darboux states that every symplectic form can be written 
in suitable coordinates as ^ dx* A dj/*. Thus all symplectic forms on 
2 n-manifolds are locally equivalent. 

On the other hand, you have shown in Problem 5-4 that the round 2- 
sphere and the Euclidean plane are not locally isometric. The key idea of 
that problem is that every tangent vector in the plane can be extended to 
a parallel vector field, so any Riemannian manifold that is locally isometric 
to must have the same property locally. 

Given a Riemannian 2-manifold M, there is an obvious way to attempt 
to construct such an extension of a vector Zp G TpM. Choose any local 
coordinates centered at p; first parallel translate Zp along the x^- 

axis, and then parallel translate the resulting vectors along the coordinate 
lines parallel to the x^-axis (Figure 7.1). The result is a vector field Z 
that, by construction, is parallel along every x^-coordinate line and along 
the x^-axis. The question is whether this vector field is parallel along x^- 
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coordinate lines other than the x^-axis itself, or in other words, whether 
Vg^Z = 0. Observe that Vg^Z vanishes when = 0, so by uniqueness of 
parallel translates it would suffice to show that 

Vg,Vg,Z = 0. (7.1) 

If we knew that 

Vg^Vg^Z = Vg^Vg^Z, (7.2) 

then (7.1) would follow immediately because Vg^Z = 0 everywhere by 
construction. Indeed, on R^, direct computation shows 

Vg,Vg,Z^Vg,{d^Z^dk) 

= d^d^Z'^dk, 

and Vg^Vg^Z is equal to the same thing, because ordinary second par¬ 
tial derivatives commute. However, (7.2) might not hold for an arbitrary 
Riemannian metric; indeed, it is precisely the noncommutativity of such 
second covariant derivatives that forces this construction to fail on the 
sphere. Lurking behind this noncommutativity is the fact that the sphere 
is “curved.” 

To express this noncommutativity in a coordinate-invariant way, let’s 
look more closely at the quantity VxVyZ — VyVjfZ. On the Euclidean 
plane, we just showed that this always vanishes ii X = di and Y = 82] 
however, for arbitrary vector fields this may no longer be true. In fact, in 
R" with the Euclidean metric we have 

VxVyZ = Vx {YZ’^dk) = XYZ^dk, 

and similarly XyXxZ = YXZ^dk- The difference between these two ex¬ 
pressions is {XYZ^ — YXZ^)dk = V[x,y]-^. Therefore the following rela¬ 
tion holds for all vector fields X, Y, Z on R": 

XY^ — XyXx^ ~ X ('^■^) 

By naturality of the Riemannian connection, it must also hold on any 
Riemannian manifold that is locally isometric to R". We’ll call (7.3) the 
flatness criterion. 

This motivates the following definition. If M is any Riemannian manifold, 
the {Riemann) curvature endomorphism is the map R: 7{M) x T(M) x 
T(M) ^ T(M) defined by 


R{X, Y)Z = XxXyZ - XyXxZ - V[x,y]^. 
Proposition 7.1. The curvature endomorphism is a Q)-tensor field. 
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Proof. By the tensor characterization lemma, we need only show that R 
is multilinear over C°°{M). It is obviously multilinear over R. For / G 

R(X,fY)Z = VxVfvZ -VfyVxZ - V^xjy]^ 

= XxifXyZ) — fVyXxZ — X f[x,y]+(xf)yZ 
= {Xf)XyZ + fVxVyZ - fVyVxZ 
- fX[x,y]Z-{Xf)XyZ 
= fR{X,Y)Z. 

The same proof shows that R is linear over C°°{M) in X, because 
R{X,Y)Z = —R{Y,X)Z from the definition. The remaining case to be 
checked is linearity over C°°{M) in Z; this is left to the reader. □ 

Exercise 7.1. Prove that R{X,Y){f Z) = fR{X,Y)Z. 

As a (i^)-tensor field, the curvature endomorphism can be written in 
terms of any local frame with one upper and three lower indices. We adopt 
the convention that the last index is the contravariant (upper) one. (This is 
contrary to our default assumption that contravariant indices come first.) 
Thus, for example, the curvature endomorphism can be written in terms 
of local coordinates (x*) as 

R = Rijkdx'^ 0 dx^ 0 dx^ ® di, 

where the coefficients Rijk^ are defined by 

R{di,dj)dk = Rijkdi. 


We also define the (Riemann) curvature tensor as the covariant 4-tensor 
field Rm = obtained from the (i^)-tensor field R by lowering the last 
index. Its action on vector fields is given by 

Rm{X, Y, Z, W) = {R{X, Y)Z, W), (7.4) 

and in coordinates it is written 

Rm = Rijkidx^ Z> dx^ 0 dx'^ 0 dx\ 

where Rijki — tJimRijk 

It is appropriate to note here that there is much variation in the litera¬ 
ture with respect to the sign conventions adopted in the definitions of the 
Riemann curvature endomorphism and curvature tensor. While almost all 
authors define the curvature endomorphism as we have, there are a few 
(notably [dC92, GHL87]) whose definition is the negative of ours. There is 
much less agreement on the sign of the curvature tensor: whichever sign is 
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chosen for the curvature endomorphism, you will see the curvature tensor 
defined as in (7.4) but with various permutations of (X, Y, Z, W) on the 
right-hand side. After applying the symmetries of the curvature tensor that 
we will prove at the end of this chapter, however, all the definitions agree up 
to sign. There are various arguments to support one choice or another; we 
have made a choice that makes equation (7.4) easy to remember. You just 
have to be careful when you begin reading any book or article to determine 
the author’s sign convention. 

One reason the curvature endomorphism and curvature tensor are inter¬ 
esting is shown by the following lemma. 

Lemma 7.2. The Riemann curvature endomorphism and curvature tensor 
are local isometry invariants. More precisely, if (p: {M,g) {M,g) is a 

local isometry, then 


ip* Rm = Rm; 

R{pi,X, pi,Y)ip^.Z = pt,{R{X, Y)Z). 
Exercise 7.2. Prove Lemma 7.2. 


Flat Manifolds 

To give a qualitative geometric meaning to the curvature tensor, we will 
show that it is precisely the obstruction to being locally isometric to Eu¬ 
clidean space. (In Chapter 8, after we have developed more machinery, we 
will be able to give a far more detailed quantitative interpretation.) 

A Riemannian manifold is said to be flat if it is locally isometric to 
Euclidean space, that is, if every point has a neighborhood that is isometric 
to an open set in R" with its Euclidean metric. 

Theorem 7.3. A Riemannian manifold is flat if and only if its curvature 
tensor vanishes identically. 

Proof. One direction is immediate: we showed above that the Euclidean 
metric satisfies the flatness criterion (7.3). Thus its curvature endomor¬ 
phism is identically zero, and hence so also is its curvature tensor. If {M,g) 
is flat, in a neighborhood of any point there is an isometry p to an open 
set in (R",g), and Lemma 7.2 shows that the curvature tensor of g is the 
pullback of that of g, and thus is zero. 

Now suppose {M,g) has vanishing curvature tensor. This means that 
the curvature endomorphism vanishes as well, so the flatness criterion (7.3) 
holds for all vector fields on M. We begin by showing that g shares one 
important property with the Euclidean metric: g admits a parallel ortho¬ 
normal frame in a neighborhood of any point. 
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FIGURE 7.2. Proof that zero curvature implies flatness. 


Let p e M, and choose any orthonormal basis ..., i?„|p) for TpM. 

Let (cc*) be any coordinates centered at p such that Ei\p = di (for example, 
normal coordinates would suffice). By shrinking the coordinate neighbor¬ 
hood if necessary, we may assume that the image of the coordinate chart 
is a cube = {x : |x*| < e, i = 1 ,..., n}. 

Begin by parallel translating each vector Ej\p along the x^-axis; then 
from each point on the x^-axis, parallel translate along the coordinate line 
parallel to the x^-axis; then successively parallel translate along coordinate 
lines parallel to the x^ through x"-axes (Figure 7.2). The result is n vector 
fields (ifi,..., En) defined in C^. The fact that the resulting vector fields 
are smooth follows from an inductive application of the theorem concern¬ 
ing smooth dependence of solutions to ODEs on initial conditions [Boo 86 , 
Theorem IV.4.2]; the details are left to the reader. 

Because parallel translation preserves inner products, it is easy to see 
that the vector fields {Ej} form an orthonormal frame. Since VxEj is 
linear over C°°{M) in X, to show that the frame is parallel it suffices to 
show that XdiEj = 0 for each i and j. 

Fix j. By construction, Xg^Ej = 0 on the x^-axis, Xg^Ej = 0 on the 
(x^,x^)-plane, and in general Xg^Ej = 0 on the slice C defined by 
= • • • = a;™ = 0. We prove the following fact by induction on k: 

Xg^Ej = ■■■ = Xg^Ej = 0 on Mfc. (7.5) 

For k = 1, this is true by construction, and for fc = n, it means that Ej 
is parallel on the whole cube C^. So assume that (7.5) holds for some k. 
On Mk+i, Xg^^^Ej = 0 by construction, and for i < k, Xg^Ej = 0 on the 
hyperplane where x*+^ = 0 by the inductive hypothesis. So it suffices to 
show that Xg^^j^{Xg.Ej) = 0. Since [9fc_|_i,9i] = 0, the flatness criterion 
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gives 


dk+iC^ diEj) = '^dii'^dk+iEj) = 0 , 

which completes the inductive step to show that the EjS are parallel. 
Because the Riemannian connection is symmetric, we have 

[Ei,Ej] = EiEj — EjEi = 0 . 

Thus the vector fields {Ei,..., E^) form a commuting orthonormal frame 
on Cg. An important fact from elementary differential geometry is the 
following “normal form for commuting vector fields”: If (Ei,..., En) are 
commuting independent vector fields on a neighborhood of p G M, there 
are coordinates (?/*) on a {possibly smaller) neighborhood of p such that 
Ei = djdy^. (See [B 0086 , p. 161], where the proof of this normal form is a 
key step in the proof of the Frobenius theorem.) In any such coordinates, 
gij = g{di., dj) = g{Ei,Ej) = 6ij, so the map y = (|/\ ..., y") is an isometry 
from a neighborhood of p to an open subset of Euclidean space. □ 

Exercise 7.3. Prove that the vector fields {Ej} constructed in the pre¬ 
ceding proof are smooth. 

Exercise 7.4. Prove (or look up) the uormal form theorem used in the 
preceding proof. 


Symmetries of the Curvature Tensor 

The curvature tensor on a Riemannian manifold has a number of symme¬ 
tries besides the obvious skew-symmetry in its first two arguments. 

Proposition 7.4. (Symmetries of the Curvature Tensor) The cur¬ 
vature tensor has the following symmetries for any vector fields W, X, Y, 
Z: 

{a) Rm{W, X, Y, Z) = -Rm{X, W, Y, Z). 

(b) Rm{W,X,Y,Z) =-Rm{W,X,Z,Y). 

(c) Rm{W,X,Y,Z) = Rm{Y,Z,W,X). 

{d) Rm{W, X, Y, Z) + Rm{X, Y, W, Z) + Rm{Y, W, X, Z) = 0. 

Before we begin the proof, a few remarks are in order. First, as the proof 
will show, (a) is a trivial consequence of the definition of the curvature 
endomorphism; (b) follows from the compatibility of the Riemannian con¬ 
nection with the metric; (d) follows from the symmetry of the connection; 
and (c) follows from (a), (b), and (d). The symmetry expressed in (d) is 
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called the algebraic Bianchi identity (or, more traditionally but less infor¬ 
matively, the first Bianchi identity). It is easy to show using (a)-(d) that a 
three-term sum obtained by cyclically permuting any three indices of Rm 
is also zero. Finally, it is useful to record the form of these symmetries in 
terms of components with respect to any basis: 

(a ) Rijkl — Rjikl- 
(b ) Rijkl — Rijlk- 
(c ) Rijkl — Rklij ■ 

(d ) Rijkl “1” Rjkil “t- Rkijl — 0- 

Proof of Proposition 7.4- Identity (a) is immediate from the obvious fact 
that R{W,X)Y = —R{X,W)Y. To prove (b), it suffices to show that 
Rm{W, X,Y,Y) = 0 for all Y, for then (b) follows from the expansion 
of Rm{W, X,Y + Z,Y + Z) = 0. Using compatibility with the metric, we 
have 

WXjYl^ = W(2(Vxr, y}) = 2(VwVxy, V) + 2(VxY, VwV}; 
XWIYI^ = X(2{Vwy, y)) = 2{VxVwy, y) + 2(Vu/U, Vxyfi 
[W,X]\Y\^ = 2{V^w,x]Y,Y). 

When we subtract the second and third equations from the first, the left- 
hand side is zero. The terms 2lfi7xY,XwY) and 2{fi7wY,'^xY) cancel on 
the right-hand side, giving 

0 = 2{Vw^xY, Y) - 2{VxVwY, Y) - 2{Vyw,x\Y, Y) 

= 2{R{W,X)Y,Y) 

= 2Rm{W,X,Y,Y). 

Next we prove (d). From the definition of Rm, this will follow immedi¬ 
ately from 

R{W, X)Y + R{X, Y)W + R{Y, W)X = 0. 

Using the definition of R and the symmetry of the connection, the left-hand 
side expands to 

{Xw^xY — XxXwY — X [w,x]Y) 

+ (VxVylU - XyXxW - X[x,Y]W) 

+ {XyXwX — VwXyX — V[y 

= XwiXxY - XyX) + XxiXyW - XwY) + Xy{XwX - XxW) 

- X[W,X]Y - V[x.v]bF - X[y^w]X 

= V w [^, d"] + Vx [1", TU] + Vy [IT, X] 

- ^[W,X]Y - V[x,F]bF - V[y^w]X 

= [W, [X, U]] + [X, [U, VF]] + [U, [lU, X]]. 
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This is zero by the Jacobi identity. 

Finally, we show that identity (c) follows from the other three. Writing 
the algebraic Bianchi identity four times with indices cyclically permuted 
gives 


Rm{W, X, Y, Z) + Rm{X, Y, W, Z) + Rm{Y, W, X, Z) = 0 
Rm(X, Y, Z, W) + Rm{Y, Z, X, W) + Rm{Z, X, Y,W) = 0 
Rm{Y, Z, W, X) + Rm(Z, W, Y, X) + Rm(W, Y, Z,X) = 0 
Rm(Z, W, X, Y) + Rm(W, X, Z, Y) + Rm{X, Z, W, Y) = 0. 

Now add up all four equations. Applying (b) four times makes all the 
terms in the first two columns cancel. Then applying (a) and (b) in the last 
column yields 2Rm{Y, W, X, Z) — 2Rm{X, Z, Y, W) = 0, which is equivalent 
to (c). □ 

There is one more identity that is satisfied by the covariant derivatives of 
the curvature tensor on any Riemannian manifold. Classically, it is called 
the second Bianchi identity, but modern authors tend to use the more 
informative name differential Bianchi identity. 

Proposition 7.5. (Differential Bianchi Identity) The total covariant 
derivative of the curvature tensor satisfies the following identity: 

XRm{X, Y, Z, V, W) + XRm{X, Y, V, W, Z) + XRm{X, Y, W, Z, V) = 0. 

(7.6) 


In components, this is 

Rijkl;m Y Rijlm;k Y Rijmk;l — 0 - (^■^) 

Proof. First of all, by the symmetries of Rm, (7.6) is equivalent to 

XRm{Z, V, X, Y, W) Y XRm{V, W, X, Y, Z) + XRm{W, Z, X, Y, V) = 0. 

(7.8) 

This can be proved by a long and tedious computation, but there is a 
standard shortcut for such calculations in Riemannian geometry that makes 
our task immeasurably easier. To prove (7.6) holds at a particular point 
p, by multilinearity it suffices to prove the formula when X, Y, Z, V, W are 
basis elements with respect to some frame. The shortcut consists of choosing 
a special frame for each point p to simplify the computations there. 

Let (x*) be normal coordinates at p, and let X, Y, Z, V, W be arbitrary 
coordinate basis vectors di. These vectors satisfy two properties that sim¬ 
plify our computations enormously: (1) their commutators vanish identi¬ 
cally, since [di, dj] = 0; and (2) their covariant derivatives vanish at p, since 
Tij{p) = 0 (Proposition 5.11(f)). 
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Using these facts and the compatibility of the connection with the metric, 
the first term in (7.8) evaluated at p becomes 

VwRm{Z, V, X, Y) = Viv(R(Z, V)X, V) 

= (VwVzVvX - VwVvVzX, V}. 

Write this equation three times, with the vector fields W, Z, V cyclically 
permuted. Summing all three gives 

XRm{Z, V, X, r, W) + XRm{V, W, X, Y, Z) + XRm{W, Z, X, Y, V) 

= — VwVyVz-X 

+ V^VyViy^ — V^ViyVyX 

+ VyVvyVzX — VyVz'Vw-^, Y) 

= {R{W, Z)XvX + R{Z, V)VwX + R{V, W)XzX, Y) 

= 0 , 

where the last line follows because VyX = XwX = XzX = 0 at p. □ 


Ricci and Scalar Curvatures 

Because 4-tensors are so complicated, it is often useful to construct simpler 
tensors that summarize some of the information contained in the curvature 
tensor. The most important such tensor is the Ricci curvature or Ricci 
tensor, denoted Rc (or often Ric in the literature), which is the covariant 
2 -tensor field defined as the trace of the curvature endomorphism on its 
first and last indices. The components of Rc are usually denoted Rij, so 
that 


TD . _ p k _ ^km TD 

ii^ij .— J^kij — 9 ^k 


ijrri’ 


The scalar curvature is the function S defined as the trace of the Ricci 
tensor: 


S := trg Rc=RC = g"^R^J■ 


Lemma 7.6. The Ricci curvature is a symmetric 2-tensor field. It can he 
expressed in any of the following ways: 


TD _ p k _ _ _ P ^ 

Jx^ij — J^kij — -^ik j — ^ki j — ^ikj 


Exercise 7.5. Prove Lemma 7.6, using the symmetries of the curvature 
tensor. 
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Lemma 7 . 7 . (Contracted Bianchi Identity) The covariant derivatives 
of the Ricci and scalar curvatures satisfy the following identity: 

div Rc = -\7S, 

2 

where div is the divergence operator {Problem 3-3). In components, this is 

(7.9) 

Proof. Formula (7.9) follows immediately by contracting the component 
form (7.7) of the differential Bianchi identity on the indices i, I and then 
again on j, k, after raising one index of each pair. □ 

It is important to note that if the sign convention chosen for the curvature 
tensor is the opposite of ours, then the Ricci tensor must be defined as the 
trace of Rm on the first and third (or second and fourth) indices. (Of course 
the trace on the first two or last two indices is always zero by antisymmetry.) 
The definition is chosen so that the Ricci and scalar curvatures have the 
same meaning for everyone, regardless of the conventions chosen for the 
full curvature tensor. So, for example, if a manifold is said to have positive 
scalar curvature, there is no ambiguity as to what is meant. 

A Riemannian metric is said to be an Einstein metric if its Ricci tensor 
is a scalar multiple of the metric at each point—that is, for some function 
A, Rc = Xg everywhere. Taking traces of both sides and noting that 

ffg 9 = 9ij9^’' = 61 = dim M, 

we find that X= (where n = dimM). Thus the Einstein condition can 
be written 

Rc=^Sg. (7.10) 

Proposition 7.8. If g is an Einstein metric on a connected manifold of 
dimension n >3, its scalar curvature is constant. 

Proof. Taking the covariant derivative of each side of (7.10) and noting 
that the covariant derivative of the metric is zero, we see that the Einstein 
condition implies 

Rij\k — R;kgij. 

Tracing this equation on j and k, and comparing with the contracted 
Bianchi identity (7.9), we conclude 

-S'.* = -S■^. 

2 ’ n ’ 

When n > 2, this implies S-i = 0. But S-^i is the component of VS = ds, so 
connectedness of M implies S is constant. □ 


"Pufuc. ~Tlt€E.£lt.c.m.€E.£tc.€M.l Ptt.y.A.lc.A. 
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By an argument analogous to those of Chapter 5, Hilbert showed (see 
[Bes87, Theorem 4.21]) that Einstein metrics are critical points for the total 
scalar curvature functional §{g) := S dV on the space of all metrics on 
M with fixed volume. Thus Einstein metrics can be viewed as “optimal” 
metrics in a certain sense, and as such they form an appealing higher¬ 
dimensional analogue of the metrics of constant Gaussian curvature on 
2 -manifolds, with which one might hope to prove some sort of generaliza¬ 
tion of the uniformization theorem (Theorem 1.7 of Chapter 1). Although 
the statement of such a theorem cannot be as elegant as that of its 2- 
dimensional ancestor because there are known examples of smooth, com¬ 
pact manifolds that admit no Einstein metrics [Bes87, chapter 6], there is 
still a reasonable hope that “most” higher-dimensional manifolds (in some 
sense) admit Einstein metrics. This is an active and wide-open field of 
current research. See [Bes87] for a sweeping survey of recent research on 
Einstein metrics. 

The term “Einstein metric” originated, as you might guess, in physics: 
The central assertion of Einstein’s general theory of relativity is that phys¬ 
ical space-time is modeled by a 4-manifold that carries a Lorentz metric 
whose Ricci curvature satisfies the following Einstein field equation: 

Rc-]^Sg = T, (7.11) 

where T is a certain symmetric 2-tensor (the stress-energy tensor) that 
describes the density, momentum, and stress of the matter and energy 
present at each point in space-time. It is shown in physics books (e.g. 
[HE73]) that (7.11) is the variational equation of a certain functional, called 
the Hilbert action, on the space of all Lorentz metrics on a given 4-manifold. 
Einstein’s theory can then be interpreted as the assertion that a physically 
realistic space-time must be a critical point for this functional. 

In the special case when T = 0, (7.11) reduces to the vacuum Einstein 
field equation Rc = \Sg. Taking traces of both sides and recalling that 
trg^ = dimM = 4, we obtain S = 25', which implies 5 = 0. Therefore the 
vacuum Einstein equation is equivalent to Rc = 0, which means that g is 
a (pseudo-Riemannian) Einstein metric in the mathematical sense of the 
word. (At one point in the development of the theory, Einstein considered 
adding a term \g to the left-hand side of (7.11), where A is a constant that 
he called the cosmological constant. With this modification the vacuum 
Einstein field equation would be exactly the same as the mathematicians’ 
Einstein equation. Einstein soon decided, however, that the cosmological 
constant was a mistake on physical grounds.) 

Other than these special cases and the obvious formal analogy between 
(7.11) and (7.10), there is no direct connection between the physicists’ 
version of the Einstein equation and the mathematicians’ version. Math¬ 
ematically, Einstein metrics are interesting not because of their relation 
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to physics, but because of their potential applications to uniformization in 
higher dimensions. 

Another approach to generalizing the uniformization theorem to higher 
dimensions is to search for metrics of constant scalar curvature. These are 
also critical points of the total scalar curvature functional, but only with 
respect to variations of the metric within a given conformal equivalence 
class. Thus it makes sense to ask whether, given a metric g on a manifold 
M, there exists a metric g conformal to g that has constant scalar cur¬ 
vature. This is called the Yamahe problem, because it was first posed in 
1960 by Hidehiko Yamabe, who claimed to have proved that the answer is 
always “yes” when M is compact. Yamabe’s proof was later found to be in 
error, and it was two dozen years before the proof was finally completed by 
Richard Schoen; see [LP87] for an expository account of Schoen’s solution. 
When M is noncompact, the issues are much subtler, and much current 
research is focused on determining exactly which conformal classes contain 
metrics of constant scalar curvature. 


^ufuc. TfLa'tit.c.MH.a'tlc.tE.l 
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Problems 

7-1. Let M be a Riemannian manifold, and (cc*) any local coordinates on 
M. 

(a) Compute the components of the Riemann curvature tensor in 
terms of the Christoffel symbols in coordinates. 

(b) Now suppose (x*) are normal coordinates centered at p G M. 

Show that the following holds at p: 

^ijkl 2 ^i^k9jl ^i^Wjk • 

7-2. Let V be the Riemannian connection on a Riemannian manifold 
{M,g), and let be its connection 1-forms with respect to a lo¬ 
cal frame {Ei} (Problem 4-5). Define a matrix of 2-forms flib called 
the curvature 2-forms, by 

Show that they satisfy Cartan’s second structural equation: 

VLi^ = A uJkE 

[Hint: Expand R{Ek, Ei)Ei in terms of V and cOiE] 

7-3. If ?7 is a 1-form, let 

Vrjkdx^ ® dx^ 0 dx^ 

be the local expression for Prove the Ricci identity 
Pi-Jk kji;kj — Rjki Pi- 

[Hint: Instead of expanding out the components of rji-jk in terms of the 
Christoffel symbols, either try to find an expression for similar 
to (4.8) and use the definition of the curvature endomorphism, or use 
the result of Problem 7-2.] 

7-4. Let V be any linear connection on a manifold M. We can define 
the curvature endomorphism of V by the same formula as in the 
Riemannian case; V is said to be flat if R{X,Y)Z = 0. Prove that 
the following are equivalent: 

(a) V is flat. 

(b) Near every point p G M, there exists a parallel local frame. 

(c) For all p,q G M, parallel translation along a curve segment 7 
from p to q depends only on the homotopy class of 7 . 

Tfln'th.c.m.n'tLc.nL ~Ph.y.S.LC.A. 
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(d) Parallel translation around any sufficiently small closed curve is 
the identity; that is, for any p G M, there exists a neighborhood 
U of p such that if 7 : [a, ^ It is a smooth curve in U starting 

and ending at p, then Pab '■ TpM TpM is the identity map. 

7-5. Let G be a Lie group with a bi-invariant metric g (see Problems 3-12 
and 5-11). Show that the Riemannian curvature endomorphism of g 
can be computed as follows: 

R{X,Y)Z=^[Z, [X,y]] 

whenever X, Y, Z are left-invariant vector fields on G. 


'T^€Lth.c.i*L€Lti.c.€LL 
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Riemannian Submanifolds 


This chapter has a dual purpose: first to develop the basic concepts of 
the theory of Riemannian submanifolds, and then to use these concepts to 
derive a quantitative interpretation of the curvature tensor. 

After introducing some basic definitions and terminology concerning sub¬ 
manifolds, we define a tensor field called the second fundamental form, 
which measures the way a submanifold curves within the ambient mani¬ 
fold. We then prove the fundamental relationships between the intrinsic 
and extrinsic geometries of a submanifold: the Gauss formula relates the 
Riemannian connection on the submanifold to that of the ambient mani¬ 
fold, and the Gauss equation relates their curvatures. We show how the 
second fundamental form can be interpreted as a measure of the extrinsic 
curvature of submanifold geodesics. 

Using these tools, we focus on the special case of hypersurfaces in R”+^, 
and show how the second fundamental form is related to the principal cur¬ 
vatures and Gaussian curvature. We prove Gauss’s Theorema Egregium, 
which shows that the Gaussian curvature of a surface in can be com¬ 
puted from the intrinsic curvature tensor. 

In the last section, we introduce the promised quantitative geometric 
interpretation of the curvature tensor. It allows us to compute sectional 
curvatures, which are just the Gaussian curvatures of 2-dimensional sub¬ 
manifolds swept out by geodesics tangent to 2-planes in the tangent space. 
Finally, we compute the sectional curvatures of our model Riemannian 
manifolds—Euclidean spaces, spheres, and hyperbolic spaces. 

Gaution must be exercised when applying the methods of this chap¬ 
ter to pseudo-Riemannian manifolds, because the restriction of a pseudo- 
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Riemannian metric to a submanifold might not be nondegenerate. (See 
Problem 8-9, though.) 

Riemannian Submanifolds and the Second 
Fundamental Form 

Definitions 

Suppose (M, g) is a Riemannian manifold of dimension m, M is a manifold 
of dimension n, and t.: M M is an immersion. If M is given the induced 
Riemannian metric g := i*g, then l is said to be an isometric immersion (or 
an isometric embedding if l happens to be an embedding). If in addition r is 
injective, so that M is an (immersed or embedded) submanifold of M, then 
M is said to be a Riemannian submanifold of M. In all of these situations, 
M is called the ambient manifold. 

All the considerations of this chapter apply to any isometric immersion. 
Since our computations are all local, and since any immersion is locally an 
embedding, we may assume M is an embedded Riemannian submanifold, 
possibly after shrinking M a bit. We usually proceed under such an as¬ 
sumption without further comment. Covariant derivatives and curvatures 
with respect to (M, g) are written in the normal way, while those with 
respect to {M,g) are written with tildes. We can unambiguously use the 
inner-product notation (A, Y) to refer to either metric g or g, since g is 
just the restriction of g to TM. 

It is easy to see that the set 

TM\m ■■= U TpM 

p&M 

is a smooth vector bundle over M, with local trivializations provided, for 
example, by the vector fields (di ,..., dm) in any coordinate chart on M. 
We call it the ambient tangent bundle over AA Any smooth vector field on 
M clearly restricts to a smooth section of TM\m. Conversely, any smooth 
section X of TM\m can be extended to a smooth section of TM by the 
same method as in the proof of Exercise 2.3. When there is no risk of 
confusion, we use the same je^tter to denote both a vector field or function 
on M and its extension to M. 

At each p G M, the ambient tangent space TpM splits as an orthogonal 
direct sum TpM = TpM 0 NpM, where NpM := (TpM)-^ is the normal 
space at p with respect to the inner product g on TpM (Figure 8.1). The 
set 

NM := NpM 

peM 
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FIGURE 8.1. The normal space at p. 


is called the normal bundle of M. To see that it is a smooth vector bundle 
over M, we use ^e result of Problem 3-1: Given any point p G M, there is 
a neighborhood It of p in M and a smooth orthonormal frame {Ei,..., Em) 
on U, called an adapted orthonormal frame, such that the restrictions of 
{El,..., En) to M form a local orthonormal frame for TM. Given any 
such frame, the last m — n vectors {E^+ilp, ■ ■ ■, Em\p) form a basis for 
NpM at each p G M, and we can use the components of a normal vector 
with respect to this basis to construct a local trivialization of NM. It is 
straightforward to check that the transition functicms are smooth, so NM 
is a vector bundle by Lemma 2.2. The notations 7{M\m) and 3Nf(M) denote 
the spaces of smooth sections of TM|m and NM, respectively. 

Projecting orthogonally at each point p G M onto the subspaces TpM 
and NpM gives maps called the tangential and normal projections 

TT^: TM\m TM 
TT-^: TM\m NM. 

In terms of an adapted orthonormal frame, these are just the usual projec¬ 
tions onto span(ifi,..., En) and span(£’„+i,..., Em) respectively, so both 
pr^ections map smooth sections to smooth sections. If X is a section of 
TM\m, we often use the shorthand notations := tt^X and Jf-*- := tt-^-X 
for its tangential and normal projections. 


The Second Fundamental Form 

Our first main task is to compare the Riemannian connection of M with 
that of M. The starting point for doing so is the orthogonal decomposition 
of sections of TM\m into tangential and orthogonal components as above. 
If X, Y are vector fields in T(M), we can extend them to vector fields on M, 
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FIGURE 8.2. The second fundamental form. 


apply the ambient covariant derivative operator V, and then decompose at 
points of M to get 

VxY={VxY)^ + {VxY)^. ( 8 . 1 ) 

We would like to interpret the two terms on the right-hand side of this 
decomposition. 

Let’s focus first on the normal component. We define the second funda¬ 
mental form of M to be the map II (read “two”) from 7{M) x 7{M) to 
?sf(M) given by 

II{X,Y) :={VxY)^, 

where X and Y are extended arbitrarily to M (Figure 8.2). Since maps 
smooth sections to smooth sections, II{X,Y) is a smooth section of NM. 

The term “first fundamental form,” by the way, was used classically 
to refer to the induced metric g on M. Although that usage has mostly 
been replaced by more descriptive terminology, we seem unfortunately to 
be stuck with the name “second fundamental form.” The word “form” in 
both cases refers to bilinear form, not differential form. 

Lemma 8.1. The second fundamental form is 

(a) independent of the extensions of X andY; 

(b) bilinear over C°°{M); and 

(c) symmetric in X and Y. 

Proof. First we show that the symmetry of II follows from the symmetry 
of the connection V. Let X and Y be extended arbitrarily to M. Then 

n{x, Y) - n{Y, X) = {VxY - VyX)^ = [x, y]^. 







Riemannian Submanifolds and the Second Fundamental Form 


135 


Since X and Y are tangent to M at all points of M, so is their Lie bracket. 
(This follows easily from Exercise 2.3.) Therefore [X, F]-*- = 0, so U is 
symmetric. 

Because VxY\p depends only on Xp, it is clear that 1I{X, Y) is indepen¬ 
dent of the extension chosen for X, and that n {X, Y) is linear over C°°{M) 
in X. By symmetry, the same is true for Y. □ 

We have not yet identified the tangential term in the decomposition of 
XxY. The following theorem shows that it is nothing other than XxY, 
the covariant derivative with respect to the Riemannian connection of g. 
Therefore, we can interpret the second fundamental form as a measure of 
the difference between the intrinsic Riemannian connection on M and the 
ambient Riemannian connection on M. 

Theorem 8.2. (The Gauss Formula) If X,Y G T(M) are extended 
arbitrarily to vector fields on M, the following formula holds along M: 

XxY = XxY + n{X,Y). 

Proof. Because of the decomposition (8.1) and the definition of the second 
fundamental form, it suffices to show that (Vxl^)^ = xY at all points 
of M. 

Define a map : T(M) x T(M) —> T(M) by 


VTF:= (Vx^^, 


where X, Y are extended arbitrarily to M. We examined a special case 
of this construction, in which g is the Euclidean metric, in Lemma 5.1. It 
follows exactly as in the proof of that lemma that is a connection on M. 
Once we show that it is symmetric and compatible with g, the uniqueness 
of the Riemannian connection on M shows that = V. 

To see that is symmetric, we use the symmetry of V and the fact 
that [X, F] is tangent to M: 


V^F - V JX = {XxY - XyX)^ 

= [X,Y]^ = [X,Y]. 

To prove compatibility with g, let X,Y,Z G 7{M) be extended arbitrarily 
to M. Using compatibility of V with g, and evaluating at points of M, 

X{Y, Z) = (VxF, Z) + (F, XxZ) 

= IfX xYY,Z) Y fiYfiXxZY) 

= (VJF,Z) + (F,VIF). 

Therefore is compatible with g, so = V. □ 
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Although the second fundamental form is defined in terms of covariant 
derivatives of vector fields tangent to M, it can also be used to evaluate 
covariant derivatives of normal vector fields, as the following lemma shows. 

Lemma 8.3. (The Weingarten Equation) Suppose X,Y G 7{M) and 
N G IN'(M). When X,Y,N are extended arbitrarily to M, the following 
equation holds at points of M: 

{VxN,Y) = -{N,n{X,Y)). 

Proof. Since (TV, Y) vanishes identically along M and X is tangent to M, 
the following holds along M: 

0 = X{N, Y) 

= {VxN,Y) + {N,VxY) 

= {XxN, Y) + {N, XxY + n{X, Y)) 

= {VxN,Y) + {N,n{X,Y)). 


□ 

In addition to describing the difference between the intrinsic and extrinsic 
connections, the second fundamental form plays an even more important 
role in describing the difference between the curvature tensors of M and M. 
The explicit formula, also due to Gauss, is given in the following theorem. 

Theorem 8.4. (The Gauss Equation) For any X, Y, Z,W G TpM, the 
following equation holds: 

Rm{X, Y, Z, W) = Rm{X, Y, Z, W) 

- {n{x, w),n{Y, Z)) + {n{x, z),n{Y, w)). 

Proof. Let X, Y, Z, W be^xtended arbitrarily to vector fields on M, and 
then to vector fields on M that are tangent to M at points of M. Along 
M, the Gauss formula gives 

Ato(A, Y, Z, W) = (Xx^yZ - XyXxZ - V[x,y]Z, w'^ 

= (Vx(VyZ + i7(y, Z)) - Xy{XxZ + J(A, Z)) 

- (V[x.v]Z + i7([X,y],Z)),VL). 


Since the second fundamental form takes its values in the normal bundle 
and W is tangent to M, the last II term is zero. Apply the Weingarten 
equation to the other two terms involving H (with n{Y,Z) or n{X,Z) 
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playing the role of IV) to get 

y, z, w) = {VxVyZ, w) - {n{Y, z),n{x, w)) 

- {VyVxZ, w) + {n{x, z),n{Y, w)) 

- {Xyx,Y]Z,W). 

Decomposing each term involving V into its tangential and normal compo¬ 
nents, we see that only the tangential component survives. Using the Gauss 
formula allows each to be rewritten in terms of V, giving 

I^{X, Y, Z, W) = {VxVyZ, W) - {VyVxZ, W) - (V[x,y]Z, W) 

- {1I{Y, Z),n{x, w)) + {1I{X, z),n{Y, w)) 

= {R{X,Y)Z,W) 

- {n{x, w),n{Y, Z)) + {n{x, z),n{Y, w)). 

This proves the theorem. □ 

Curvature of Curves 

By studying the curvature of curves in Riemannian manifolds, we can give 
a more geometric interpretation to the second fundamental form. If 7 : I 
M is a unit speed curve in a Riemannian manifold, we define the {geodesic) 
curvature of 7 as the function k : / —> R given by 

K{t) = \Dt^{t)\. 

If 7 is an arbitrary regular curve (not necessarily unit speed), we first 
reparametrize it by arc length to get a unit speed curve, and then define 
the curvature by this formula as a function of arc length. Clearly n vanishes 
identically if and only if 7 is a geodesic, so it may be thought of as a 
quantitative measure of how far 7 deviates from being a geodesic. If M = 
R" with the Euclidean metric, this is the same as the classical notion of 
curvature introduced in advanced calculus courses. 

Exercise 8.1. If 7 is a unit speed curve in R” and ft (to) 7 ^ 0, show that 
there is a unique unit speed parametrized circle c: R ^ R'^, called the 
osculating circle at 7 (to), with the property that c and 7 have the same 
position, velocity, and acceleration at t = to. Show that ft(to) = 1/R, where 
R is the radius of the osculating circle. 

Exercise 8.2. Suppose 7 : 7 ^ M is a regular curve in a Riemannian 
manifold, but not necessarily unit speed. Show that the curvature of 7 at 
7 (t) is 

|A7(t)| _ {77t7(t),7(t)) 

|7(t)P |7(t)|^ • 





138 


8. Riemannian Submanifolds 


If M ^ M is a Riemannian submanifold and 7 is a curve in M, 7 has two 
distinct geodesic curvatures: its “intrinsic^urvature k as a curve in M, and 
its “extrinsic” curvature if as a curve in M. The second fundamental form 
can be used to compute the relationship between the two. First we need 
another version of the Gauss formula, better suited to covariant derivatives 
along curves. 

Lemma 8.5. (The Gauss Formula Along a Curve) Let M he a Rie¬ 
mannian submanifold of M, and 7 a curve in M. For any vector field V 
tangent to M along 7 , 


DtV = DtV + n{j,V). 

Proof. In terms of an adapted orthonormal frame, V can be written V ft) = 
V'^ff)Ei, where the sum is only over i = 1,..., n. Applying the product rule 
and the Gauss formula, we get 

btV = + V^V^E, 

= V^E, + V^V^E, + y* 17(7, Ei) 

= DtV-hE{^,V). 


□ 

Applying this lemma to the special case in which F = 7 , we obtain the 
following formula for the acceleration of any curve in M: 

bti = Ay + 77 ( 7 , 7 ). 

If 7 is a geodesic in M, this formula simplifies to 


7 ?t 7 = 77(7,7). 

Thus we obtain the following concrete geometric interpretation of the 
second fundamental form: For any vector V G TpM, IIfV,V) is the g- 
acceleration at p of the g-geodesic yy. If V is a unit vector, \IIfV,V)\ is 
the g-curvature ofjv at p. Note that the second fundamental form is sym¬ 
metric and bilinear, so it is completely determined by its values of the form 
77(1/, y) as y ranges over unit vectors tangent to M. 

In the special case in which M is R'" with the Euclidean metric, we 
can make this geometric interpretation even more concrete: 77(y, V) is the 
ordinary Euclidean acceleration of the geodesic in M with initial velocity 
V. 


Exercise 8.3. Suppose M C is a submanifold with the induced Rie¬ 
mannian metric, 7 is a curve in M, and y is a vector field tangent to M 
along 7 . 
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(a) Show that DtV{t) is the orthogonal projection onto TM of the ordinary 
Euclidean derivative V{t). 

(b) Show that 7 is a geodesic in M if and only if its Euclidean acceleration 
7 is everywhere normal to M. 

(c) Use this to give another proof that the geodesics on the n-sphere are 
the great circles. 

We say a Riemannian submanifold M C M is totally geodesic if for every 
V e TM, the g-geodesic jv lies entirely in M. 

Exercise 8.4. Show that the following are equivalent for a Riemannian 
submanifold M C M: 

(a) M is totally geodesic. 

(b) Every p-geodesic in M is also a p-geodesic in M. 

(c) The second fundamental form of M vanishes identically. 

Hypersurfaces in Euclidean Space 

Now we specialize the preceding considerations to the case in which M 
is a hypersurface (i.e., a submanifold of codimension 1) in with the 

induced Riemannian metric. We denote the Euclidean metric as usual by 
g. Covariant derivatives and curvatures associated with g will be indicated 
by a bar. 

In this situation, at each point of M there are exactly two unit normal 
vectors. If M is orientable (which we may assume by passing to a subset of 
M), we can use an orientation to pick out a unique normal. The resulting 
vector field iV is a smooth section of NM, as can be seen easily by noting 
that in terms of any local adapted orthonormal frame {Ei,... ,En+i), it 
must be = ±En+i. We will address as we go along the question of how 
various quantities depend on the choice of normal vector field. 


The Scalar Second Fundamental Form and the Shape Operator 

Given a unit normal vector field N, we can replace the vector-valued second 
fundamental form E hy a, somewhat simpler scalar-valued form. The scalar 
second fundamental form h is the symmetric 2-tensor on M defined by 

h{X,Y) = {n{X,Y),N). 

Since fV is a unit vector spanning NM at each point, this is equivalent to 

E{X,Y) = h{X,Y)N. 
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Note that the sign of h depends on which unit normal is chosen, but h is 
otherwise independent of choices. 

Raising one index of h, we get a tensor field s € which can also 

be thought of as a field of endomorphisms of TM by Lemma 2.4, called the 
shape operator of M. It is characterized by 

{X, sY) = h{X,Y) for all X, y G T(M). 

Because h is symmetric, s is a selfadjoint endomorphism of TM, that is, 

{sX, Y) = {X, sY) for all X,Y € T(M). 

As with h, the sign of s depends on the choice of N. 

In terms of the tensor fields h and s, the formulas of the last section can 
be rewritten somewhat more simply. First, we have the Gauss formula for 
Euclidean hypersurfaces: 

XxY = XxY + h{X,Y)N. 

Second, the Weingarten equation can be written 

{XxN, Y) = -h{X, Y) = -{sX, Y). (8.2) 

Since (VxN,N) = = 0, it follows that XxN is tangent to M, so 

(8.2) is equivalent to the Weingarten equation for Euclidean hyper surf aces: 

XxN = -sX. (8.3) 

Finally, since Rm = 0 on R"+^, the Gauss equation for Euclidean hyper¬ 
surfaces is 

Rm{X, Y, Z, W) = h{X, W)h{Y, Z) - h{X, Z)h{Y, W). (8.4) 

If 7 is a curve in M, its Euclidean acceleration vector can be decomposed 
into tangential and normal components in the usual way. By the Gauss 
formula, they are 

7 = Dt'f = Dt-f + /i(7, 7)N. 

If 7 is a unit speed geodesic in M, its intrinsic acceleration Dt^ is zero. Its 
Euclidean acceleration therefore has only a normal component, 

7 = 

and its Euclidean curvature is 

K=\j\ = |ft.(7,7)|. 

Therefore h{'j, 7 ) = ±75, with a positive sign if and only if 7 points in the 
same direction as N. This shows that the scalar second fundamental form 
has the following geometric interpretation: For any unit vector V G TpM, 
h{V, V) is the signed Euclidean curvature atp of the M-geodesic {Figure 
8.3), with a positive sign ifjy is curving toward N at p and a negative sign 
if it is curving away from N. 
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FIGURE 8.3. Geometric interpretation of h{V, V). 


Principal Curvatures 

At any point p G M, we have seen that the shape operator s is a selfadjoint 
linear transformation on the tangent space TpM . From elementary linear 
algebra, any such operator has real eigenvalues ki, ..., Kn, and there is an 
orthonormal basis {Ei,..., En) for TpM consisting of s-eigenvectors, so 
that sEi = KiEi (no summation). In this basis both h and s are diagonal, 
and h has the expression 

h{X, Y) = KiX^Y^ + • • • + 

The eigenvalues of s are called the principal curvatures of M at p, and 
the corresponding eigenspaces are called the principal directions. They are 
independent of choice of basis, but the principal curvatures change sign 
if we change the normal vector. The principal curvatures give a concise 
description of the local shape of the embedded surface M, in a sense made 
precise by the following exercise. 

Exercise 8.5. Suppose M C is a hypersurface with the induced 

metric. Let p € M, and let ki, ..., fv„ denote the principal curvatures of M 
at p with respect to some choice of unit normal. 

1. Show that M can be approximated locally by the quadratic polynomial 

in the following sense: There are Euclidean coordinates {x,y) — 

... ,a:", 2 /) centered at p such that M is described locally by an 
equation of the form y = f{x), where the second-order Taylor series of 
/ at the origin is 

fix) = ^Ciix^)^ H-h K„{x")‘^) + 0{\xf). 

2. If n = 2, show that ki and K 2 are equal to the minimum and maximum 
signed Euclidean curvatures of M-geodesics passing through p, and also 
to the minimum and maximum signed Euclidean curvatures of plane 
curves obtained by intersecting M with planes orthogonal to TpM. 
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Gaussian and Mean Curvatures 

There are two combinations of the principal curvatures that play particu¬ 
larly important roles for Euclidean hypersurfaces. The Gaussian curvature 
is defined as K = det s, and the mean curvature as H = (1/n) tr s = 
(l/n)trg/i. Since the determinant and trace of a linear map are basis- 
independent, these are well defined. In terms of the principal curvatures, 
they are 


K = K1K2 ■ ■ ■ Kn] H = -(ki -I- • • • -I- K„). 

n 

We know from Proposition 5.13 that the geodesics on the round 2-sphere 
S|. of radius R are exactly the great circles of radius R. Since these have 
Euclidean curvature 1 /R, it is immediate that the principal curvatures at 
any point are Ki = K 2 = ±l/i?. Therefore Si has constant mean curvature 
H = zLl/R and constant Gaussian curvature K = IjR"^. 

For other surfaces in R^, the Gaussian and mean curvatures are usually 
easiest to compute in terms of parametrizations. Let M C be a smooth 
surface, and let : If ^ R^ be a local parametrization of M. The coordi¬ 
nates on If C R^ thus give local coordinates for M. The coordinate 

vector fields di = djdu'' push forward to vectors XMi = diX (thinking of 
X{u) = {X^(u), X^(u), X^(u)) as a vector-valued function of u) in R^ that 
are tangent to M. Their ordinary cross product is therefore normal to M, 
so one choice of unit normal is 

^ ^ diX X d 2 X 
\diX X 92-^1' 

We can then compute the shape operator using the Weingarten equation 
for Euclidean hypersurfaces (8.3): 

sd, = -V a,N = -m, 

where again we think of as a vector-valued function of u, and use the fact 
that the directional derivative V 9 . can be evaluated by differentiating along 
the u^-coordinate curve in M. After expressing sdi and 382 in terms of the 
basis vectors {diX,d 2 X), it is straightforward to compute the principal 
curvatures. Problems 8-1, 8-2, and 8-3 will give you practice in carrying 
out these computations for surfaces presented in various ways. 

A hypersurface M with mean curvature identically equal to zero is called 
minimal. The reason for this terminology is that, by an argument analo¬ 
gous to those of Ghapter 6 , one can show that iJ = 0 is the variational 
equation for the surface area functional A{M) = dV (where dV is the 
Riemannian volume element for the induced metric g). Thus hypersurfaces 
with mean curvature zero are precisely the critical points for the functional 
A. In particular, any hypersurface that minimizes surface area among those 
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with a fixed boundary has zero mean curvature, and a small enough piece 
of every minimal hypersurface is area minimizing. We do not pursue the 
subject any further in this book, but you can find a good introductory 
account in [LawSO]. 

Clearly the mean curvature of any hypersurface changes sign if we change 
the sign of the normal vector field. If n is odd, the Gaussian curvature also 
changes sign, but if n is even (in particular for surfaces in R^), the Gaussian 
curvature is independent of the choice of IV. In any case, both the Gaussian 
and mean curvatures are defined in terms of a particular embedding of M 
into R"+^, and there is little reason to suspect that they have much to do 
with the intrinsic Riemannian geometry of M with its induced metric g. 
The amazing discovery made by Gauss was that the Gaussian curvature 
of a surface in R^ is actually an intrinsic invariant of the Riemannian 
manifold {M,g). 

Theorem 8.6. (Gauss’s Theorema Egregium) Let M C R^ be a 2- 

dimensional submanifold and g the induced metric on M. For any p € M 
and any basis {X, Y) for TpM, the Gaussian curvature of M at p is given 
by 


Rm{X, Y, Y, X) 

|x|2|y|2 _ {x,Yy 


(8.5) 


Therefore the Gaussian curvature is an isometry invariant of{M,g). 


Proof. We begin with the special case in which {X,Y) = {Ei,E 2 ) is an 
orthonormal basis for TpM. In this case the denominator in (8.5) is equal 
to 1. If we write hij = h(Ei,Ej), then in this basis K = det s = det {hij), 
and the Gauss equation (8.4) reads 


Rm{Ei, E 2 , E 2 , El) — hiih 22 ~ ^ 12^21 — det(/iij) — K. 


This is equivalent to (8.5). 

Now let X, Y be any basis for TpM. The Gram-Schmidt algorithm yields 
an orthonormal basis as follows: 


El 

E 2 


X 

m’ 


y-(y. 


X \ x 
|X|/|X| 


y-(y. 


X \ X 
|X|/|X| 


Y - 


{ypGx 

|X|2 ^ 


Y - 


(y,x) 

|xp 
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Then by the preceding computation, the Gaussian curvature at p is 
K = Rm{Ei, E 2 , E 2 , El) 

Rm (X, r - ^X, Y - X) 


Rm{X, Y, Y, X) 



Rm{X, Y, Y, X) 
|x|2|r|2 - {X, y)2- 


(In the third line, we used the fact that Rm{X, X, •, •) = Rm{-, •, X,X) = 0 
by the symmetries of the curvature tensor.) This proves the theorem. □ 

Motivated by the Theorema Egregium, we define the Gaussian curvature 
K of an abstract Riemannian 2-manifold {M,g), not necessarily embedded 
in R^, by formula (8.5) in terms of any local frame {X,Y). In the spe¬ 
cial case in which M is a Riemannian submanifold of R^, the Theorema 
Egregium shows that this agrees with the extrinsic definition of K as the 
determinant of the scalar second fundamental form. 

Lemma 8.7. The Gaussian curvature of a Riemannian 2-manifold is re¬ 
lated to the curvature tensor, Ricci tensor, and scalar curvature by the 
formulas 

Rm{X, Y, Z, W) = K{{X, W){Y, Z) - {X, Z){Y, W))- 

Rc{X,Y) = K{X,Y); (8.6) 

S = 2K. 

Thus K is independent of choice of frame, and completely determines the 
curvature tensor. 

Proof. Since both sides of the first equation are tensors, we can compute 
them in terms of any basis. Let {Ei, E 2 ) be any orthonormal basis for TpM, 
and consider the components Riju = Rm{Ei, Ej, Ek, Ei) of the curvature 
tensor. In terms of this basis, (8.5) gives K = i?i 22 i- By antisymmetry, 
Rijki vanishes whenever i = j or k = I, so the only nonzero components of 
Rm are 

.R 1221 = .R 2112 = —R 1212 = —.R 2121 = K. 

Comparing Rm{X,Y, Z,W) with K{{X,W){Y, Z) — {X, Z){Y,W)) when 
each of X, Y, Z, W is either Ei or E 2 proves the first equation of ( 8 . 6 ). 
The components of the Ricci tensor in this basis are 

Rij = Riiji -\- R2ij2j 
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from which it follows easily that 

i?i2 = i?2i = 0; ^11 = R 22 = K, 

which is equivalent to the second equation. Finally, the scalar curvature is 

S = trg Rc = Rii + R 22 = 2K. 

Because the scalar curvature is independent of choice of frame, so is K. □ 

Although the Ricci tensor always satisfies Rc = Alg on a 2-manifold, this 
does not imply that K is constant in two dimensions, as you can see from 
the proof of Proposition 7.8. Thus the notion of an Einstein metric is not 
useful for 2-manifolds. 

Exercise 8.6. Show that the hyperbolic plane H|j of radius R has con¬ 
stant Gaussian curvature K = —IjR?. [Hint: Show that it suffices to com¬ 
pute K at one point; the coordinate computations are easiest at the origin 
in the disk model.] 


Geometric Interpretation of Curvature in Higher 
Dimensions 

Sectional Curvatures 

Now we can give a quantitative geometric interpretation to the curvature 
tensor in any dimension. Let M be a Riemannian n-manifold and p € M. If 
n is any 2-dimensional subspace of TpM, and V C TpM is any neighborhood 
of zero on which exp^ is a diffeomorphism, then S'n := expp(n n V) is a 
2-dimensional submanifold of M containing p (Figure 8.4), called the plane 
section determined by 11. Note that S'n is just the set swept out by geodesics 
whose initial tangent vectors lie in 11. 
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We define the sectional curvature of M associated with 11, denoted -ff(n), 
to be the Gaussian curvature of the surface S'n at p with the induced metric. 
If {X,Y) is any basis for 11, we also use the notation K{X,Y) for Ktjl). 

Proposition 8.8. If {X,Y) is any basis for a 2-plane 11 C TpM, then 


K{X,Y) 


Rm{X, Y, Y, X) 

|x|2|y|2 _ (x,y)2- 


(8.7) 


Proof. For this proof, we denote the induced metric on S'n by g, and con¬ 
tinue to denote the metric on M by g. As in the first part of this chapter, 
we use tildes to denote geometric quantities associated with g, but note 
that now the roles of g and g are reversed. 

We claim first that the second fundamental form of Sn vanishes at p. To 
see why, let G 11 C TpM, and let 7 = qy be the M-geodesic with initial 
velocity V, which lies in Sn by definition. By the Gauss formula for vector 
fields along curves. 


0 = Dtf = 1)47-1-77(7,7). 

Since the two terms in this sum are orthogonal, each must vanish identically. 
Evaluating at t = 0 gives II{V,V) = 0. Since V was an arbitrary element 
of TpM and 77 is symmetric, this shows that H = 0 at p. (We cannot in 
general expect 77 to vanish at other points of Sn—it is only at p that all 
geodesics starting tangent to S remain in S.) 

Now the Gauss equation tells us that the curvature tensors of Sn and M 
are related at p by 


Rm{X, Y, Z, W) = Rm{X, Y, Z, W) 

whenever X, Y,Z,W G 11. In particular, the Gaussian curvature of Sn at p 
is 

Rm{X,Y,Y,X) Rm{X,Y,Y,X) 

^ ’ |A|2|y|2 _ (x,y)2 |A|2|y|2 _ (x,y)2- 

This is what was to be proved. □ 

Thus one important class of quantitative information provided by the 
curvature tensor is the sectional curvatures of all plane sections. It turns 
out, in fact, that this is the only information contained in the curvature 
tensor: as the following lemma shows, the sectional curvatures completely 
determine the curvature tensor. 

Lemma 8.9. Suppose IRi and IR 2 o-re covariant A-tensors on a vector space 
V with an inner product, and both have the symmetries of the curvature 
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tensor (as described in Proposition 7.4)- If for every pair of independent 
vectors X,Y GV, 

Iii{X,Y,Y,X) _ tR2{X,Y,Y,X) 

|x|2|y|2 _ (x,y)2 |x|2|y|2 _ (x,y)2’ 

then iRi = 3^2- 

Proof. Setting fR = iRi — 3^2, it suffices to show 3? = 0 under the assumption 
that 3?(X, y, Y,X)=0 for all X, Y. 

For any vectors X, Y, Z, since 3R also has the symmetries of the curvature 
tensor, 

0 = 3?(X + Y,Z,Z,X + Y) 

= tR{X, Z, Z, X) + 3R(y, Z, Z, Y) + 3l(y, Z, Z, X) + tR{Y, Z, Z, Y) 

= 231{X, Z, Z,Y). 

From this it follows that 

0 = 3R(y, Z +W,Z + W,Y) 

= 3R(y, Z, Z, Y) + tR{X, Z, W, Y) + Jl(X, W, Z, Y) + Jl(X, W, W, Y) 

= Jl(X, Z, W, Y) + Jl(X, W, Z, Y). 

Therefore 3? is antisymmetric in any adjacent pair of arguments. Now the 
algebraic Bianchi identity yields 

0 = 3R(y, Y, Z, W) + 3l(y, Z, X, W) + 3l(Z, X, Y, W) 

= 5i{x, Y, z, w) - 3i(y, y, z, w) - :r{x, z, y, w) 

= 3Il{X,Y,Z, W). 

□ 

We can also give a geometric interpretation for the Ricci and scalar 
curvatures. Given any unit vector V S TpM, choose an orthonormal basis 
{Ei} for TpM such that Ei = V. Then Rc{V, V) is given by 

n n 

Rc{V, V) = Rn = Rkii'^ = Ei,E^,Ek) = ^ K{E^,Ek). 

k=l k=2 

Therefore the Ricci tensor has the following interpretation: For any unit 
vector V € TpM, Rc{V,V) is the sum of the sectional curvatures of planes 
spanned by Vand other elements of an orthonormal basis. Since Rc is sym¬ 
metric and bilinear, it is completely determined by its values of the form 
Rc{V, V) for unit vectors V. 

Similarly, the scalar curvature is 

n n 

S = R/ = ^ Rc{Ej,E^) = Y, Rm{Ek, Ej,E^,Ek) = Y 

j = l j,k=l j^k 
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Therefore the scalar curvature is the sum of all sectional curvatures of 
planes spanned by pairs of orthonormal basis elements. 

If the opposite sign convention is chosen for the curvature tensor, then 
the right-hand side of formula (8.7) has to be adjusted accordingly, with 
Rm{X, Y, X, Y) taking the place of Rm{X, Y, Y, X). This is so that what¬ 
ever sign convention is chosen for the curvature tensor, the notion of posi¬ 
tive or negative sectional curvature has the same meaning for everyone. 


Sectional Curvatures of the Model Spaces 

We can now compute the sectional curvatures of our three families of ho¬ 
mogeneous model spaces. Note first that each model space has an isometry 
group that acts transitively on orthonormal frames, and so acts transitively 
on 2-planes in the tangent bundle. Therefore each has constant sectional 
curvature, which means that the sectional curvatures are the same for all 
planes at all points. 

First we consider the simplest case: Euclidean space. Since the curvature 
tensor of R” is identically zero, clearly all sectional curvatures are zero. 
This is obvious geometrically, since each plane section is actually a plane, 
which has zero Gaussian curvature. 

Next consider the sphere S^. We need only compute the sectional cur¬ 
vature for the plane 11 spanned by (9i, 92) at the north pole. The geodesics 
with initial velocity in 11 are great circles in the (x^,a;"^^) subspace. 
Therefore is isometric to the round 2-sphere of radius R embedded in 
R^. As we showed earlier in this chapter, S|j, has Gaussian curvature R?. 
Therefore has constant sectional curvature equal to 1/i?^. 

Finally we come to the hyperbolic spaces. It suffices to consider the 
point N = (0,..., i?) in the hyperboloid model, and the plane 11 C TatH^ 
spanned by d/dff, d/dC ■ The geodesics with initial velocities in 11 are great 
hyperbolas lying in the r) subspace; they sweep out a 2-dimensional 

hyperboloid that is easily seen to be isometric to H^. By Exercise 8.6, 
therefore, KfU) = —1/Rf, so has constant sectional curvature —XjRS. 
(See also Problem 8-9 for another approach.) 


Exercise 8.7. Show that real projective space RP" has a metric of con¬ 
stant positive sectional curvature. 


Since the sectional curvatures determine the curvature tensor, one would 
expect to have an explicit formula for Rm when the sectional curvature is 
constant. Such a formula is provided in the following lemma. 

Lemma 8.10. Suppose {M,g) is any Riemannian n-manifold with con¬ 
stant sectional curvature C. The curvature endomorphism, curvature ten- 
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sor, Ricci tensor, and scalar curvature of g are given by the formulas 

R{X, Y)Z = C{{Y, Z)X - {X, Z)Y)- 
Rm{X, Y, Z, = C{{X, W){Y, Z) - {X, Z){Y, W))- 
Rc = (n — l)Cg; 

S = n{n — 1)C'. 

In terms of any basis, 

Rijki — Cl^gugjk Qikgji)', 

Rij — (n I^Cgij. 

Exercise 8.8. Prove Lemma 8.10. 
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Problems 


8-1. Let M C be a surface of revolution as described in Exercise 3.3 
and Problem 5-2. 

(a) If the generating curve 7 is unit speed, show that the Gaussian 
curvature of M is —a{t)/a{t). 

(b) Show that there is a surface of revolution in that has constant 
Gaussian curvature equal to 1 but does not have constant mean 
curvature. 


8-2. Suppose O is an open set in R" and /: O ^ R is a smooth function. 
Let M = {(cc,/(cc)) : a; G 0} C be the graph of /. Observe 

that the map (p: ft ^ M given by (p{u) = {u,f{u)) gives a global 
parametrization of M; the corresponding coordinates (u^,..., u”) on 
M are called graph coordinates. 

(a) Let N be the upward-pointing unit normal vector field along 
M. Gompute the components of the shape operator in graph 
coordinates, in terms of / and its partial derivatives. 

(b) Let M C R”+^ be the paraboloid defined as the graph of f{u) = 
|Mp. Gompute the principal curvatures of M. 

8-3. Let O C be an open set, F: ^ R a smooth submersion, and 

M = {F is called a defining function for M.) Show that the 

scalar second fundamental form of M with respect to the unit normal 
vector field N = gradF/| gradFj is given by 


h{V, W) 


d^djFV^V^ 
IgradEp ’ 


where V = V^di in Euclidean coordinates on R"+^. Derive formulas 
for the Gaussian and mean curvatures of F in the case n = 2. 


8-4. Let M C R^ be the catenoid, which is the surface of revolution ob¬ 
tained by revolving the curve x = cosh 2 ; around the z-axis. Show 
that M is a minimal surface. 

8-5. Suppose M C M is a compact, embedded, Riemannian submanifold. 
For any e > 0, let denote the subset {V : \V\ < £} of the nor¬ 
mal bundle NM, and the set of points in M whose Riemannian 
distance from M is less than e. 


(a) Prove the tubular neighborhood theorem: For £ sufficiently small, 
the restriction to Ng of the exponential map of M is a diffeo- 
morphism from to M^. Any open set that is the image 
of such a diffeomorphism is called a tubular neighborhood of M. 
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(b) If r{x) denotes the distance from x G M to M, show that 
is a smooth function on any tubular neighborlyood Mg. Give an 
example in which is not smooth on all of M. 

8 -6. Let M C be a hypersurface with the induced metric and N a 

smooth unit normal vector field along M. At each point p G M, Np G 
TpR"+^ can be thought of as a unit vector in R"+^ and therefore 
as a point in S". Thus each choice of normal vector field defines a 
smooth map iV: M S", called the Gauss map of M. Show that 

O O 

N*dV = K dVg, where dV is the volume element of S" with the 
round metric, and K is the Gaussian curvature of M. 

8-7. Suppose 5 = 5 i © 32 is a product metric on Mi x M 2 as in (3.3). 

(a) Show that for each point pi G Mi, the submanifolds Mi x {^ 2 } 
and {pi} X M 2 are totally geodesic. 

(b) If n C T(Mi X M 2 ) is a 2-plane spanned by Xi G TMi and 
X 2 G TM 2 , show that Ar(n) = 0. 

(c) Show that the product metric on x has nonnegative sec¬ 
tional curvature. 

(d) Show that there is an embedding of in x such that the 
induced metric is flat. 

8 -8. Gonsider the basis 



for the Lie algebra su(2). For each positive real number a, define a left- 
invariant metric Qa on the group SU (2) by declaring aX, Y, Z to be 
an orthonormal frame. Gompute the sectional curvatures with respect 
to 3a of the planes spanned by {X,Y), {Y, Z), and {Z,X). [Remark: 
SU{2) is diffeomorphic to by the map that sends (a, /?) G C 

to G SU{2). These metrics are called the Berger metrics 

on SA] 

8-9. This problem outlines another proof that the sectional curvature of 
is - 1 /R 2 . 

(a) If M is a pseudo-Riemannian manifold, a submanifold l: M ^ 
M is called spacelike if Gg is positive definite on M. Show that 
the Gauss formula and the Gauss equation hold for spacelike 
submanifolds. 

(b) Prove that the sectional curvature of is — 1 by applying 
the Gauss equation to the hyperboloid model. 
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8-10. Suppose M is a connected n-dimensional Riemannian manifold, and 
a Lie group G acts effectively on M by isometries. (A group action is 
said to be effective if no element of G other than the identity acts as 
the identity on M.) Show that dim G < n{n+l) /2, and that equality 
is possible only if M has constant sectional curvature. 

8-11. Let p: {M,g) {M,g) be a Riemannian submersion (Problem 3-8). 

Using the notation and results of Problem 5-9, show that the sectional 
curvatures of g are related to those of g by 


K{X,Y) 


K{X,Y) 


3 

4 


[A,y]^ 


for any pair X, Y of orthonormal vector fields on M. 

8-12. Let p: ^ CP" be the Riemannian submersion described in 

Problem 3-9. We identify C"+i with R"+i x R"+i by means of co¬ 
ordinates (x^,..., ..., defined by + iyG 

(a) Show that the vector field 


T = x^ 


d 

dyi 


d 

dxi 


on is tangent to and spans the vertical space 14 at 

each point z G S^"+^. 

(b) If VP, Z are horizontal vector fields on show that 

[VP, ZY = -da;(VP, Z)T = 2(VP, JZ)T, 


where lo is the 1-form on given by 

= Y = x^ (bf — y dx^ , 


w 


and J: TC"+i ^ TC"+i is the orthogonal, real-linear map 


J ( X^^YY^-^ 
dx3 dyi 


= X^ — - Y^—. 
dyi dx3 


(This is just multiplication by i = ff-X in complex coordinates.) 

(c) Let W, Z be orthonormal vectors in TCP". Show that the sec¬ 
tional curvature K{W, Z) is 

K{W,Z) = 1-V3(VP, 


(See Problem 8-11.) 





Problems 


153 


(d) If n > 2, show that at each point of CP", the sectional curva¬ 
tures take on all values between 1 and 4, inclusive. Compute the 
Gaussian curvature of CP^. 

8-13. Suppose (M, g) is a 3-dimensional Riemannian manifold that is homo¬ 
geneous and isotropic. Show that g has constant sectional curvature. 
Show that the analogous result in dimension 4 is not true. [Hint: See 
Problem 8-12.] 

8-14. Let G be a Lie group with a bi-invariant metric g (see Problem 7-5). 

(a) Show that the sectional curvatures of g are all nonnegative. 

(b) If C G is a Lie subgroup, show that H is totally geodesic. 

(c) If H is connected, show that it is flat in the induced metric if 
and only if it is Abelian. 
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The Gauss-Bonnet Theorem 


We are finally in a position to prove our first major local-global theorem in 
Riemannian geometry: the Gauss-Bonnet theorem. This is a local-global 
theorem par excellence, because it asserts the equality of two very differently 
defined quantities on a compact, orientable Riemannian 2-manifold M: 
the integral of the Gaussian curvature, which is determined by the local 
geometry of M; and 27r times the Euler characteristic of M, which is a 
global topological invariant. Although it applies only in two dimensions, it 
has provided a model and an inspiration for innumerable local-global results 
in higher-dimensional geometry, some of which we will prove in Ghapter 
11 . 

This chapter begins with some not-so-elementary notions from plane 
geometry, leading up to a proof of Hopf’s rotation angle theorem, which 
expresses the intuitive idea that the tangent vector of a simple closed curve, 
or more generally of a “curved polygon,” makes a net rotation through 
an angle of exactly 27r as one traverses the curve counterclockwise. Then 
we investigate curved polygons on Riemannian 2-manifolds, leading to a 
far-reaching generalization of the rotation angle theorem called the Gauss- 
Bonnet formula, which expresses the relationship among the exterior angles, 
the geodesic curvature of the boundary, and the Gaussian curvature in the 
interior of a curved polygon. Finally, we use the Gauss-Bonnet formula to 
prove the global statement of the Gauss-Bonnet theorem. 
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Some Plane Geometry 

Look back for a moment at the three local-global theorems about plane 
geometry stated in Chapter 1: the angle-sum theorem, the circumference 
theorem, and the total curvature theorem. When looked at correctly, these 
three theorems all turn out to be manifestations of the same phenomenon: 
as one traverses a simple closed plane curve in the counterclockwise direc¬ 
tion, the tangent vector makes a net rotation through an angle of exactly 
27r. Our task in the first part of this chapter is to make these notions precise. 

Throughout this section, 7: [a, b] ^ is a unit speed admissible curve 
in the plane. We say 7 is simple if it is injective on [a,b), and closed if 
7(6) = 7(a). 

If 7 is smooth, we can define the tangent angle 9{t) as the unique contin¬ 
uous map 9-. [a, ^ R such that 7(t) = {cos 9(t), sin 9(t)) for all t G [a, 6], 

and such that 9{a) G (—tt, tt]. That such a continuous choice of angle exists 
follows from the theory of covering spaces: since 7 is unit speed, and the 
tangent space to R^ is naturally identified with R^ itself, we can think of 
7 as a map from [a, b] to S^. By the path-lifting property of covering maps 
[Mas67, Lemma V.3.1], this map lifts to the universal covering tt: R ^ 
given by 7r{9) = (cos 0, sin 0). Our tangent angle function 9 is the unique 
continuous lift with the additional property that 9{a) G (—7r,7r]. Because 
7: [a, 5] —> is a smooth map, and the covering map tt is a local diffeo- 

morphism, it follows that 9 is actually smooth. 

If 7 is a unit speed regular closed curve such that 7(a) = j{b) (Figure 
9.1), we define the rotation angle of 7 to be Rot(7) := 9{b) — 9{a), where 9 
is the tangent angle function defined above. Clearly Rot(7) is an integral 
multiple of 27r, since 9{a) and 9{b) both represent the angle from the x-axis 
to 7(a). (Note that our choice of normalization 9{a) G (— tt, tt] is immaterial 
here; we just chose it so that 9 would be uniquely defined.) 
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FIGURE 9.2. An exterior angle. 



FIGURE 9.3. A cusp. 


We would also like to extend the definition of the rotation angle to certain 
piecewise smooth closed curves. For this purpose, we have to take into 
account the “jumps” in the tangent angle at corners. To do so, recall that 
7 has left-hand and right-hand tangent vectors at t = a^, denoted 7(a~) 
and respectively. Define the exterior angle at at to be the oriented 

angle Si from j{a~) to 7(0^'’), chosen to be in the interval [—7r,7r], with a 
positive sign if (7(0^"),7(0^'’)) is an oriented basis for R^, and a negative 
sign otherwise (Figure 9.2). (If j(a~) = —j(a^), 7 has a “cusp” and there is 
no unambiguous way to choose between tt and —tt (Figure 9.3); for now we 
leave it unspecified.) If 7 is closed, define the exterior angle at 7(0) = 7(6) 
to be the angle from 7(6) to 7(0), chosen in the interval [— tt,tt]. 

The curves we wish to consider are of the following type: A curved polygon 
in the plane is a simple, closed, piecewise smooth, unit speed curve segment, 
none of whose exterior angles is equal to ±7r, that is the boundary of a 
bounded open set C R^. If a = oq < • • • < Ofc = 5 is a subdivision of 
[a,b] such that 7 is smooth on [ai_i,ai], the points 7(0^) are called the 
vertices of 7, and the curve segments 7|[oi_i,ai] are called its edges or sides. 

If 7 is parametrized so that at points where 7 is smooth, 7 is consistent 
with the induced orientation on 7 = dO. in the sense of Stokes’s theorem, 
we say 7 is positively oriented (Figure 9.4). Intuitively, this just means that 
7 is parametrized in the counterclockwise direction, or that is always to 
the left of 7. 

Suppose 7 is a curved polygon. We define the tangent angle 9: [a, 5] ^ R 
as follows (Figures 9.5 and 9.6): Beginning with 9{a) G (—7r,7r], we define 
0{t) for t G [a, oi) to be the unique continuous choice of angle from the 
x-axis to j{t) as above. At the first vertex 7(01), let 


9{ai) = lim 9{t) + Si. 

ty^ai 
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FIGURE 9.4. A positively oriented curved polygon. 



FIGURE 9.5. Defining the tangent FIGURE 9.6. The tangent angle 
angle at a vertex. function for the curve in Fig. 9.5. 


(See Figure 9.5.) Then extend 9 continuously on [ 01 , 02 ), and continue by 
induction, until finally 

9{b) = lim 6{t) + Sk, 

ty'b 

where Sk is the exterior angle at j{b). We define the rotation angle of 7 
to be Rot( 7 ) := 6{b) — 9{a). Rot( 7 ) is again an integral multiple of 2tt, 
because the definition ensures that 9{b) and 9{a) are both representations 
of the angle from the x-axis to 7 ( 0 ). 

The following theorem is due to Heinz Hopf [Hop35] (for a more acces¬ 
sible version of the proof, see [Hop83, formula (7.1)]). In the literature, 
it is sometimes referred to by the German name given to it by Hopf, the 
Umlaufsatz. 
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FIGURE 9.7. The curve 7 after changing the parameter interval and 
translating 7 (a) to the origin. 


Theorem 9.1. (Rotation Angle Theorem) If j is a positively oriented 
curved polygon in the plane, the rotation angle of j is exactly 2 tt. 


Proof. Suppose first that all the exterior angles are zero. This means, in 
particular, that 7 is continuous and 7 ( 0 ) = 7 (&). Since 7 is closed, we can 
extend it to a continuous map from R to R^ by requiring it to be periodic of 
period b — a. Our hypothesis that 7 (a) = 7 ( 6 ) guarantees that the extended 
map still has continuous first derivatives. 

Rot ( 7 ) is clearly unchanged if we consider 7 as being defined on any 
interval [d, b] of length b — a (this just changes the point at which we 
start). Let’s choose our parameter interval [d, b] such that the y-coordinate 
of 7 achieves its minimum at t = d; for convenience, we relabel the new 
interval as [a, 6 ]. Moreover, by a translation we may as well assume that 
7 (a) is the origin. Then the image of 7 remains in the upper half-plane, 
and 7 (a) = 7 ( 6 ) = d/dx (Figure 9.7). 

Since 7 is continuous, so is the tangent angle function 9: [a, b] —> R. 
We will extend this function to a continuous secant angle function ip{ti,t 2 ) 
defined on the triangle T := {(ti,t2) '■ a, ^ ti < t2 < b} (Figure 9.8), 
representing the angle between the x-axis and the vector from 7 (^ 1 ) to 
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FIGURE 9.8. The domain of the se¬ 
cant angle function. 



7(^2)- To be precise, first define a map U: T ^ by 


V (ti,t 2 ) 


f 7 (^ 2 ) - 7(ti) 

I 17(^2) -7(^1)!’ 
[-7(0), 


ti < t 2 and {ti,t 2 ) ^ {a,b); 
ti = t 2 ', 

{ti,t 2 ) = {a,b). 


V is continuous along the line ti = t 2 , because 

lih) - 7{ti) ^ lih) - 7{ti) j 7ft2) -7(^1) 

17(^2) - 7(^1)! (il,t2)^(t.i) t 2 -ti / t 2 -t\ 

= 7W/I7WI 
= 


A similar argument shows that V is continuous at (a, b), using the fact that 
7(a) = 7(6). Since T is simply connected, the theory of covering spaces (cf. 
[Mas67, Theorem V.5.1]) guarantees that U: T —> has a continuous lift 
ip: T ^ R, which is unique if we require <p(a,a) = 0 (Figure 9.9). This is 
our secant angle function. 

We can write Rot(7) = d(b) — d(a) = ip(b,b) — ip(a,a) = ip(b,b). Ob¬ 
serve that, along the side of T where ti = a and <2 G [0,6], the vector 
V has its tail at the origin and its head in the upper half-plane. Since we 
stipulate that ip{a,a) = 0, we must have ip{a,t 2 ) € [0,7r] on this segment. 
By continuity, therefore, ip{a, b) = tt (since (f{a, b) represents the angle of 
—7(a) = —d/dx). Similarly, on the side where t 2 = b,V has its head at the 
origin and its tail in the upper half-plane, so ip{ti,b) G [7r,27r]. Therefore, 
since (p{b, b) represents the angle of 7(6) = djdx, we must have (p{b, b) = 2 tt. 
This completes the proof for the case where 7 is continuous. 

Now suppose 7 has vertices. It suffices to show there is a curve with a 
continuous tangent vector that has the same rotation angle as 7. We will 
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construct such a curve by “rounding the corners” of 7 . It will simplify the 
proof somewhat if we choose the parameter interval [a, b] so that 7 ( 0 ) = 
7 ( 6 ) is not a vertex. 

Let 7 (ai) be any vertex, and Si its exterior angle. Let a be a small positive 
number depending on Si; we will describe how to choose it later. Recall that 
our definition of 9{t) guarantees that 0 is continuous from the right, and 
limj/'o^ 6 *(f) = 0{ai) — £i. Therefore, we can choose 6 small enough that 
\9{t) — {9{ai) — Ei)! < a when t G {ui — S,ai), and \9{t) — 9{ai) \ < a when 

t € (tti, Oi + S). 

The image under 7 of [a, b] — (ai — 6, Ui + 6) is a compact set disjoint 
from 7 (aj), so we can choose r small enough that 7 does not enter Brijiai)) 
except when t S (ui —6, ai + 6). Let ti G (ai — 6,ai + 6) denote the time when 
7 enters and <2 the time when it leaves (Figure 9.10). By our 

choice of S, the total change in 9(t) is not more than a when t G [ti, Ui), and 
again not more than a when t G {ai, t 2 \. Therefore, if a is small enough, the 
total change A9 in 9{t) during the time interval [^ 1 ,^ 2 ] is between £j — 2a 
and Si + 2a. If we choose a < ^{n — |ei|), it satisfies — tt < A9 < tt. 

Now we simply replace the portion of 7 from time ti to time ^2 with a 
smooth curve segment a that is tangent to 7 at 7 (^ 1 ) and 7 (^ 2 ), and whose 
tangent angle increases or decreases monotonically from 9{ti) to 9{t2)', an 
arc of a hyperbola will do (Figure 9.11). Since the change in tangent angle 
of a is between —tt and tt and represents the angle between 7 (^ 1 ) and 7 (^ 2 ), 
it must be exactly A9. (The length of a may not be the same as that of 
the portion of 7 being replaced, but we can simply reparametrize the new 
curve by arc length.) Repeating this process for each vertex, we obtain a 
new curve with a continuous tangent vector field whose rotation angle is 
the same as that of 7 , thus proving the theorem. □ 


T^€Lth.C.I*L€Ltl.C.€LL "Ph-^SjJILA. 







162 


9. The Gauss-Bonnet Theorem 





FIGURE 9.12. A curved polygon on a surface. 


From the rotation angle theorem, it is not hard to deduce the three local- 
global theorems mentioned at the beginning of the chapter as corollaries. 
(The angle-sum theorem is trivial; for the total curvature theorem, the trick 
is to show that 9{t) is equal to the signed curvature of 7 ; the circumference 
theorem follows from the total curvature theorem as mentioned in Chapter 
1.) However, instead of proving them directly, we will prove a general for¬ 
mula, called the Gauss-Bonnet formula, from which these results and more 
follow easily. You will easily see how the statement and proof of Theorem 
9.3 below can be simplified in case the metric is Euclidean. 


The Gauss-Bonnet Formula 

We now direct our attention to the case of an oriented Riemannian 2- 
manifold {M,g). In this setting, a unit speed curve 7 : [a,b] ^ M is called 
a curved polygon if 7 is the boundary of an open set O with compact closure, 
and there is a coordinate chart containing 7 and O under whose image 7 
is a curved polygon in the plane (Figure 9.12). Using the coordinates to 
transfer 7 , O, and g to the plane, we may as well assume that 5 is a metric 
on some open subset If C R^, and 7 is a curved polygon in If. 
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For a curved polygon 7 in M, our previous definitions go through almost 
unchanged. We say 7 is positively oriented if it is parametrized in the 
direction of its Stokes orientation as the boundary of We define the 
exterior angle Si at a vertex 7 ( 0 ^) as the oriented angle from 7 ( 0 ”) to 7 ( 0 ^) 
with respect to the g-inner product and the given orientation of M, so it 
satisfies cossi = ( 7 ( 0 ^), 7 ( 0 “)). Having chosen coordinates, we define the 
tangent angle 9: [a, &] —> R on segments where 7 is continuous as the unique 
continuous choice of angle from d/dx to 7 , measured with respect to g, with 
jumps at vertices as before. The rotation angle is Rot( 7 ) = 9{b) — 9{a). 
Because of the role played by d/dx in the definition, it is not clear yet that 
the rotation angle has any coordinate-invariant meaning; however, we do 
have the following easy consequence of the rotation angle theorem. 

Lemma 9.2. //y is a positively oriented curved polygon in M, the rotation 
angle of j is 2 tt. 

Proof. If we use the given coordinate chart to consider 7 as a curved poly¬ 
gon in the plane, we can compute its tangent angle function either with 
respect to g or with respect to the Euclidean metric g. In either case, Rot( 7 ) 
is an integral multiple of 27r because 9{a) and 9{b) both represent the same 
angle. Now for 0 < s < 1, let = sff + (1 — s)g. By the same reasoning, 
the rotation angle Rotg^ ( 7 ) with respect to gs is also a multiple of 27r. The 
function /(s) = (l/27r) Rot^^ ( 7 ) is therefore integer-valued, and is easily 
seen to be continuous in s, so it must be constant. □ 

There is a unique unit normal vector field along the smooth portions of 
7 such that {j{t), N{t)) is an oriented orthonormal basis for for 

each t. If 7 is positively oriented as the boundary of H, this is equivalent 
to N being the inward-pointing normal to did (Figure 9.13). We define the 
signed curvature KN{t) at smooth points of 7 by 

KN{t) = {Dtj{t),N{t)). 
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By differentiating | 7 (t)P = 1, we see that Dtj{t) is orthogonal to 7 (t), and 
therefore we can write = KN{t)N{t), and the (unsigned) curvature 

of 7 is K{t) = |KAr(t)|. The sign of is positive if 7 is curving toward 

and negative if it is curving away. 

Theorem 9.3. (The Gauss Bonnet Formula) Suppose j is a curved 
polygon on an oriented Riemannian 2-manifold {M,g), and 7 is positively 
oriented as the boundary of an open set with compact closure. Then 

/ KdA-\- / kn ds+ 'y^ei = 2 tt, (9.1) 

Jo, J-f . 

where K is the Gaussian curvature of g and dA is its Riemannian volume 
element. 

Proof. Let a = qq < ■ • • < Uk = b he a, subdivision of [a, b] into segments on 
which 7 is smooth. Using the rotation angle theorem and the fundamental 
theorem of calculus, we can write 




9{t) dt. 


2 = 1 


i — l ^ 


(9.2) 


To prove (9.1), we need to derive a relationship among 0, kat, and K. 

We begin by constructing a specially adapted orthonormal frame. Let 
{x,y') be oriented coordinates on an open set 11 containing 7 and U. The 
Gram-Schmidt algorithm applied to the frame {djdx^dldy) yields an ori¬ 
ented orthonormal frame such that Ei is a positive multiple of 

d/dx. Then, because 9{t) represents the g-angle between Ei and j{t), it is 
easy to see that the following hold at smooth points of 7 : 

7 ( 1 ) = cos 0{t)Ei sin 9{t)E2] 

N(t) = —sin9{t)Ei cos9{t)E2. 

Differentiating 7 (and omitting the t dependence from the notation for 
simplicity), we get 

Dtj = —9{sm 9)Ei (cos 9)\/jEi 9(cos 9)E2 (sin 0) Vyi ?2 
= 9N+{cos 9)Vj El+ {sm9)V.yE2. 

Next we analyze the covariant derivatives of Ei and E 2 . Because (Ui, i? 2 ) 
is an orthonormal frame, for any vector X we have 

0 = Xx\Ei\‘^ = 2{XxEi,Ei) 

0 = = 2{\/xE2, E 2 ) 

0 = Xx{Ei, E2) = CVxEi, E2) {Ei,XXE2)■ 
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The first two equations show that is a multiple of E 2 and Vx-E '2 is 

a multiple of Ei. Define a 1-form ui by 

U!{X) := {Ei,VXE 2 ) = —{VxEi,E2). 

It follows that the covariant derivatives of the basis elements are given by 

\7xEi =-lo{X)E2; 

VxE2=uj{X)Ei. 

Thus the 1-form lu completely determines the connection in U. (In fact, 
when the connection is expressed in terms of the local frame {Ei} as in 
Problem 4-5, this computation shows that the connection 1-forms are just 

ljJ2^ = = W, = 0J2^ = 0.) 

Using (9.3) and (9.4), we can compute 
Kx = {Dt'y,N) 

= {0N,N) + cos0{X^Ei,N) +sm0{X^E2,N) 

= 0 — cos0{uj{'})E 2, N) + sin0{u;{'y)Ei, N) 

= 0 — cos^ 0aj(j) — sin^ 0aj(j) 

= 0 — 

Therefore, (9.2) becomes 

^ ^ pO>i ^ p0.i 

27r = / Kxit) dt + 

i=l i=l da.i-1 -loi-l 

= ^ ei+ / kn ds+ / to. 

i=i “'7 “'T 

The theorem will therefore be proved if we can show that 


I KdA. (9.5) 

n 

If 7 were a smooth closed curve, Stokes’s theorem would imply that the 
left-hand side of (9.5) is equal to duj. In fact, this is true anyway: by a 
construction similar to that used in the proof of the rotation angle theorem, 
we can approximate 7 uniformly by a sequence of smooth curves 7 ^ whose 
lengths approach that of 7 , and that are boundaries of domains flj such that 
the area between flj and U approaches zero. Applying Stokes’s theorem on 
flj and taking the limit as j ^ 00 , we conclude that 



27r = -I- 


Kx ds ■ 
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The last step of the proof is to show that dtu = K dA. This follows from 
the general formula relating the curvature tensor and the connection 1- 
forms given in Problem 7-2; but in the case of two dimensions we can give an 
easy direct proof. Since (ifi, E 2 ) is an oriented orthonormal frame, it follows 
by definition of the Riemannian volume element that dA{Ei, E 2 ) = 1. Using 
(9.4), we compute 

K dA{Ei,E 2 ) = K = Rm{Ei,E 2 , E 2 ,Ei) 

= (SE2E2 — E 2 ^E1E2 — V [Ei,E- 2 \E 2 ^ El) 

= {u){E2)Ei) — Ve 2 {^iEi)Ei) — uj[Ei^ E2]Ei, Ei) 

= {Ei{ui{E2))Ei + ui{E2)'^ e-iEi — E2{uj{Ei))Ei 
— uj{Ei)\IE2E1 — uj[Ei, E 2 ]Ei, El) 

= Ei{uj{E 2 )) — E 2 {uj{Ei)) — uj[Ei,E 2 ] 

= dLv{Ei, E 2 )- 

This completes the proof. □ 

The three local-global theorems of plane geometry stated in Chapter 1 
follow from the Gauss-Bonnet formula as easy corollaries. Their proofs are 
left to the reader. 

Corollary 9.4. (Angle-Sum Theorem) The sum of the interior angles 
of a Euclidean triangle is tt. 


Corollary 9.5. (Circumference Theorem) The circumference of a Eu¬ 
clidean circle of radius R is 2 t:R. 


Corollary 9.6. (Total Curvature Theorem) If^: [a,b] —> is a unit 
speed simple closed curve such that j(a) = j(b), and N is the inward¬ 
pointing normal, then 


nb 

/ KN{t) dt = 2 tt. 

J a 


Exercise 9.1. Prove the three corollaries above. 


The Gauss-Bonnet Theorem 

It is now a relatively easy matter to “globalize” the Gauss-Bonnet formula 
to obtain the Gauss-Bonnet theorem. The link between the local and global 
results is provided by triangulations, so we begin by discussing this con¬ 
struction borrowed from algebraic topology. Most of the topological ideas 
touched upon in this section can be found treated in detail in either [Sie92] 
or [Mas67]. 

If M is a smooth, compact 2-manifold, a smooth triangulation of M is 
a finite collection of curved triangles (i.e., three-sided curved polygons). 
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FIGURE 9.14. Illegal intersections FIGURE 9.15. Valid intersections, 
of triangles in a triangulation. 


such that the union of the closed regions fli bounded by the triangles is 
M, and the intersection of any pair (if not empty) is either a single vertex 
of each or a single edge of each (Figures 9.14 and 9.15). Every smooth, 
compact surface possesses a smooth triangulation. In fact, it was proved 
by Tibor Rado [Rad25] in 1925 that every compact topological 2-manifold 
possesses a triangulation (without the assumption of smoothness of the 
edges, of course). There is a proof for the smooth case that is not terribly 
hard, outlined in Problem 9-5. 

If M is a triangulated 2-manifold, the Euler characteristic of M with 
respect to the given triangulation is defined to be 

X(M) :=N,-Ne + Nf, 

where Ny is the number of vertices in the triangulation, Ng is the number 
of edges, and Nf is the number of faces (the fl^s). It is an important result 
of algebraic topology that the Euler characteristic is in fact a topological 
invariant, and is independent of the choice of triangulation (see [Sie92, 
Theorem 13.3.1]). 

Theorem 9.7. (The Gauss—Bonnet Theorem) If M is a triangulated, 
compact, oriented, Riemannian 2-manifold, then 



K dA = 2ttx{M). 


Proof. Let {fli : i = 1,... ,Nf} denote the faces of the triangulation, and 
for each i let {jij : j = 1,2,3} be the edges of fli and {% : j = 1,2,3} 
its interior angles. Since each exterior angle is tt minus the corresponding 
interior angle, applying the Gauss-Bonnet formula to each triangle and 
summing over i gives 


Nf 3 

EE 

i=ii=i• 


Nf 3 Nf 

EE(’"“^b) = E2^- 

i—lj—1 i—1 
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FIGURE 9.16. Interior angles at a vertex add up to 27r. 


Note that each edge integral appears exactly twice in the above sum, 
with opposite orientations, so the integrals of kn all cancel out. Thus (9.6) 
becomes 


IM 


KdA + SirNf - 


Nf 3 

EE- 

i=i i=i 


ij — 2TrNf. 


(9.7) 


Note also that each interior angle 9ij appears exactly once. At each vertex, 
the angles that touch that vertex must add up to 27r (Figure 9.16); thus 
the angle sum can be rearranged to give exactly 2TrNy. Equation (9.7) thus 
can be written 



KdA = 2TTNy-TrNf. 


(9.8) 


Finally, since each edge appears in exactly two triangles, and each tri¬ 
angle has exactly three edges, the total number of edges counted with mul¬ 
tiplicity is 2Ne = 3fVy, where we count each edge once for each triangle 
in which it appears. This means that Nf = 2Nf. — 2Nf, so (9.8) finally 
becomes 



KdA = 2TrNy - 2TrNe + 2TTNf = 2ttx{M). 


□ 

The significance of this theorem cannot be overstated. Together with 
the classification theorem for compact surfaces, it gives us a very complete 
picture of the possible Gaussian curvatures for metrics on compact surfaces. 
The classification theorem (see, for example, [Sie92, Theorem 13.2.5] or 
[Mas67, Theorem 1.5.1]) says that every compact, orientable 2-manifold 
is homeomorphic to a sphere or the connected sum of g tori, and every 
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nonorientable one is homeomorphic to the connected sum of g copies of the 
projective plane P^; the number g is called the genus of the surface. (The 
sphere is said to have genus zero.) By constructing simple triangulations, 
it is easy to check that the Euler characteristic of an orientable surface of 
genus 5 is 2 — 2g, and that of a nonorientable one is 2 — g. 

Corollary 9.8. Let M he a compact Riemannian 2-manifold and K its 
Gaussian curvature. 

(a) If M is homeomorphic to the sphere or the projective plane, then 
K > 0 somewhere. 

(b) If M is homeomorphic to the torus or the Klein bottle, then either 
K = 0 or K takes on both positive and negative values. 

(c) If M is any other compact surface, then K <0 somewhere. 

Proof. If M is orientable, the result follows immediately from the Gauss- 
Bonnet theorem, because a function whose integral is positive, negative, 
or zero must satisfy the claimed sign condition. If M is nonorientable, the 
result follows by^^plying the Gauss-Bonnet theorem to the orientable 
double cover n: M ^ M with the lifted metric g = TT*g, using the fact 
that M is the sphere if M = P^, the torus if M is the Klein bottle (which 
is homeomorphic to the connected sum of two copies of P^), and otherwise 
has x(M) <0. □ 

This corollary has a remarkable converse, proved in the mid-1970s by 
Jerry Kazdan and Frank Warner: If K is any smooth function on a com¬ 
pact 2-manifold M satisfying the necessary sign condition of Corollary 9.8, 
then there exists a Riemannian metric on M for which K is the Gaussian 
curvature. The proof is a deep application of the theory of nonlinear partial 
differential equations. (See [Kaz85] for a nice expository account.) 

In Gorollary 9.8 we assumed we knew the topology of M and drew con¬ 
clusions about the possible curvatures it could support. In the following 
corollary we reverse our point of view, and use assumptions about the cur¬ 
vature to draw conclusions about the manifold. 

Corollary 9.9. Let M he a compact Riemannian 2-manifold and K its 
Gaussian curvature. 

(a) If K >0, then M is homeomorphic to the sphere or projective plane, 

and is finite. 

(b) If K <0, then is infinite, and M has genus at least 1. 

Exercise 9.2. Prove Corollary 9.9. 

Much of the effort in contemporary Riemannian geometry is aimed at 
generalizing the Gauss-Bonnet theorem and its topological consequences to 
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higher dimensions. As we will see in the next chapter, most of the interest¬ 
ing results have required the development of different methods. However, 
there is one rather direct generalization of the Gauss-Bonnet theorem that 
deserves mention: the Chern-Gauss-Bonnet theorem. This was proved by 
Hopf in 1925 for an n-manifold embedded in with the induced met¬ 

ric, and in 1944 by Ghern for abstract Riemannian manifolds (see [Spi79, 
volume 5] for a complete discussion with references). The theorem asserts 
that on any oriented vector space there exists a basis-independent function 

T: {4-tensors with the symmetries of Rm} —> R, 


called the Pfajjian, such that for any oriented compact even-dimensional 
Riemannian n-manifold M, 


IM 


T(i?TO) dV=- Vol(S”)x(M). 


(Here x(M) is again the Euler characteristic of M, which can be defined 
analogously to that of a surface and is a topological invariant.) 

In a certain sense, this might be considered a very satisfactory gener¬ 
alization of Gauss-Bonnet. The only problem with this result is that the 
relationship between the Pfaffian and sectional curvatures is obscure in 
higher dimensions, so no one seems to have any idea how to interpret the 
theorem geometrically! For example, it is not even known whether the as¬ 
sumption that M has strictly positive sectional curvatures implies that 
X(M) > 0. 
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Problems 

9-1. Let M C be a compact, orientable, embedded 2-manifold with 
the induced metric. 

(a) Show that M cannot have K < 0 everywhere. [Hint: Look at a 
point where the distance from the origin takes a maximum.] 

(b) Show that M cannot have K > 0 everywhere unless x{^) > 0- 

9-2. Let {M,g) be a Riemannian 2-manifold. A curved polygon on M 
whose sides are geodesic segments is called a geodesic polygon. If g 
has everywhere nonpositive Gaussian curvature, prove that there are 
no geodesic polygons with exactly 0, 1, or 2 vertices. Give examples 
of all three if the curvature hypothesis is not satisfied. 

9-3. A geodesic triangle on a Riemannian 2-manifold (M,g) is a three- 
sided geodesic polygon (Problem 9-2). 

(a) If M has constant Gaussian curvature K, show that the sum of 
the interior angles of a geodesic triangle 7 is equal to t: + KA, 
where A is the area of the region bounded by 7 . 

(b) Suppose M is either the 2-sphere of radius R or the hyperbolic 
plane of radius R. Show that similar triangles are congruent. 
More precisely, if 71 and 72 are geodesic triangles with equal 
interior angles, then there exists an isometry of M taking 71 to 
72- 

9-4. An ideal triangle in the hyperbolic plane is a region whose bound¬ 
ary consists of three geodesics, any two of which meet at a common 
point on the boundary of the disk (in the Poincare disk model). Show 
that all ideal triangles have the same finite area, and compute it. Be 
careful to justify any limits. 

9-5. This problem outlines a proof that every compact smooth 2-manifold 
has a smooth triangulation. 

(a) Show that it suffices to prove there exist finitely many convex ge¬ 
odesic polygons whose interiors cover M, and each of which lies 
in a uniformly normal convex geodesic ball. (A geodesic polygon 
is called convex if it together with its interior is a convex set in 
the sense of Problem 6-4.) 

(b) Using the result of Problem 6-4, show that there exist finitely 
many points (ui,..., Ufc) and £ > 0 such that the geodesic balls 

are convex and uniformly normal, and the balls Bgivi) 
cover M. 
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(c) For each i, show that there is a convex geodesic polygon in 
Bseivi) whose interior contains B^(vi). [Hint: Let the vertices 
be sufficiently nearby points on the circle of radius 2e around 

Vi. 

(d) Prove the result. 

9-6. Prove the plane curve classification theorem (Theorem 1.5). [Hint: 
Any plane curve satisfies the ordinary differential equation j(t) = 
KN{t)N{t).] 





10 

Jacobi Fields 


Our goal for the remainder of this book is to generalize to higher dimensions 
some of the geometric and topological consequences of the Gauss-Bonnet 
theorem. We need to develop a new approach: instead of using Stokes’s 
theorem and differential forms to relate the curvature to global topology 
as in the proof of the Gauss-Bonnet theorem, we study how curvature 
affects the behavior of nearby geodesics. Roughly speaking, positive cur¬ 
vature causes nearby geodesics to converge (Figure 10.1), while negative 
curvature causes them to spread out (Figure 10.2). In order to draw topo¬ 
logical consequences from this fact, we need a quantitative way to measure 
the effect of curvature on a one-parameter family of geodesics. 

We begin by deriving the Jacobi equation, which is an ordinary differen¬ 
tial equation satisfied by the variation field of any one-parameter family of 
geodesics. A vector field satisfying this equation along a geodesic is called 
a Jacobi field. We then introduce the notion of conjugate points, which 
are pairs of points along a geodesic where some Jacobi field vanishes. In¬ 
tuitively, if p and q are conjugate along a geodesic, one expects to find a 
one-parameter family of geodesics that start at p and end (almost) at q. 

After defining conjugate points, we prove a simple but essential fact: the 
points conjugate to p are exactly the points where exp^ fails to be a local 
diffeomorphism. We then derive an expression for the second derivative 
of the length functional with respect to proper variations of a geodesic, 
called the “second variation formula.” Using this formula, we prove another 
essential fact about conjugate points: No geodesic is minimizing past its 
first conjugate point. 

In the final chapter, we will derive topological consequences of these facts. 
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FIGURE 10.1. Positive curvature FIGURE 10.2. Negative curvature 
causes geodesics to converge. causes geodesics to spread out. 

The Jacobi Equation 

In order to study the effect of curvature on nearby geodesics, we focus 
on variations through geodesics. Suppose therefore that 7 : [a,b] M is 
a geodesic segment, and T: {—e, e) x [a, 6 ] ^ M is a variation of 7 (as 
defined in Chapter 6 ). We say E is a variation through geodesics if each of 
the main curves rs(t) = r(s,f) is also a geodesic segment. (In particular, 
this requires that T be smooth.) Our first goal is to derive an equation that 
must be satisfied by the variation field of a variation through geodesics. 

Write T{s,t) = dtT{s,t) and S{s,t) = dsT{s,t) as in Chapter 6 . The 
geodesic equation tells us that 

DtT = 0 

for all {s,t). We can take the covariant derivative of this equation with 
respect to s, yielding 

DsDtT = 0. 

To relate this to the variation field of 7 , we need to commute the covariant 
differentiation operators Dg and D^. Because these are covariant derivatives 
acting on a vector field along a curve, we should expect the curvature to 
be involved. Indeed, we have the following lemma. 

Lemma 10.1. IfT is any smooth admissible family of curves, and V is a 
smooth vector field along T, then 

DgDtV - DtDgV = R{S,T)V. 

Proof. This is a local issue, so we can compute in any local coordinates. 
Writing V{s,t) = V^{s,t)di, we compute 

dV^ 

DtV = — 9 , + VWA. 
at 

Therefore, 

02 yi gyt Qyi 

DgDtV = + -^Dgd, + —Dtd, + VWgDtd,. 
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Interchanging Dg and Dt and subtracting, we see that all the terms except 
the last cancel: 


D,DtV - DtDsV = {DsDtdi - DtD.di). 


( 10 . 1 ) 


Now we need to compute the commutator in parentheses. If we write the 
coordinate functions of F as x^{s,t), then 


3x^ 

S = ^dk-, 

as 


Because di is extendible, 


T= —d- 
dt 


dx^ , 


Dtd, = VTdi = —Vd,d„ 
dt ^ 

and therefore, because \7Q.di is also extendible, 

/ dx^ , 


D^Dfdj = ZJ, 


V dt 


^oA 


d‘^x^ dx^ ^ „ N 

d'^x^ „ dx^ dx^ ^ ^ „ 

^Ihdt^^^ "^AtA^ *■ 

Interchanging s ^ t and j ^ k and subtracting, we find that the first 
terms cancel out, and we get 

dx^ dx^ 

DsDtdi - DtDsd, = AdAdAi - ^dAd,A 


dt ds 
dx^ dx^ 


R{dk,dj)^^ 


dt ds 

= R{S,T)A 

Finally, inserting this into (10.1) yields the result. 


□ 


Theorem 10.2. (The Jacobi Equation) Let j be a geodesic and V a 
vector field along 7 . If V is the variation field of a variation through geo¬ 
desics, then V satisfies 

DA + RiV,j)j = 0. ( 10 . 2 ) 


Proof. With S and T as before, the preceding lemma implies 

0 = DsDtT 

= DtDsT+R{S, T)T 
= DtDtS + R{S,T)T, 

where the last step follows from the symmetry lemma. Evaluating at s = 0, 
where S'(0,t) = V{t) and T(0,t) = j{t), we get (10.2). □ 
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Any vector field along a geodesic satisfying the Jacobi equation is called a 
Jacobi field. Because of the following lemma, which is a converse to Theorem 
10.2, each Jacobi field tells us how some family of geodesics behaves, at least 
“infinitesimally” along 7 . 

Lemma 10.3. Every Jacobi field along a geodesic 7 is the variation field 
of some variation of 7 through geodesics. 

Exercise 10.1. Prove Lemma 10.3. [Hint: Let r{s,t) = exp^^^j tlF(s) for 
a suitable curve a and vector field W along a.] 

Now we reverse our approach: let’s forget about variations for a while, 
and just study Jacobi fields in their own right. As the following lemma 
shows, the Jacobi equation can be written as a system of second-order 
linear ordinary differential equations, so it has a unique solution given initial 
values for V and DtV at one point. 

Proposition 10.4. (Existence and Uniqueness of Jacobi Fields) Let 

7 : / ^ M be a geodesic, a G I, and p = 7 (a). For any pair of vectors 
X,Y G TpM, there is a unique Jacobi field J along 7 satisfying the initial 
conditions 


J{a) = X] DtJ{a) = Y. 

Proof. Choose an orthonormal basis {Ei} for TpM, and extend it to a 
parallel orthonormal frame along all of 7 . Writing J{t) = J^{t)Ei, we can 
express the Jacobi equation as 

J* + = 0. 

This is a linear system of second-order ODEs for the n functions JL Making 
the usual substitution U* = J® converts it to an equivalent first-order linear 
system for the 2n unknowns {J®,U®}. Then Theorem 4.12 guarantees the 
existence and uniqueness of a solution on the whole interval I with any 
initial conditions J®(a) = X®, U®(a) = Y\ □ 

Corollary 10.5. Along any geodesic 7 , the set of Jacobi fields is a 2n- 
dimensional linear subspace o/T( 7 ). 

Proof. Let p = 7 (a) be any point on 7 , and consider the map from the set 
of Jacobi fields along 7 to TpM(BTpM by sending J to {J{a), DtJ{a)). The 
preceding proposition says precisely that this map is bijective. □ 

There are always two trivial Jacobi fields along any geodesic, which 
we can write down immediately (see Figure 10.3). Because Dt'f = 0 and 
= 0 by antisymmetry of R, the vector field Jo(t) = 7 (t) satisfies 
the Jacobi equation with initial conditions 

Jo(0) = 7(0); A Jo(0) = 0. 
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FIGURE 10.3. Trivial Jacobi fields. 


Similarly, Ji(i) = tjit) is a Jacobi field with initial conditions 
Ji(0)=0; AJi(0)=7(0). 

It is easy to see that Jq is the variation field of the variation r(s,t) = 
7 (s + t), while Ji is the variation field of r(s, t) = Therefore, these 

two Jacobi fields just reflect the possible reparametrizations of 7 , and don’t 
tell us anything about the behavior of geodesics other than 7 itself. 

To distinguish these trivial cases from more informative ones, we make 
the following definitions. A tangential vector field along a curve 7 is a vector 
field V such that Vft) is a multiple of fft) for all t, and a normal vector 
field is one such that V(t) ± 'y(t) for all t. 

Lemma 10.6. Let I —>■ M be a geodesic, and a € I. 

(o) A Jacobi field J along 7 is normal if and only if 

J{a) T 7 (a) and DtJ{a) T 7 ( 0 ). (10.3) 

(&) Any Jacobi field orthogonal to 7 at two points is normal. 

Proof. Using compatibility with the metric and the fact that Dtf = 0, we 
compute 

^(J,7) = (AV,7) 

= -Rm{J,f,f,f) = 0 

by the symmetries of the curvature tensor. Thus, by elementary calculus, 
f{f) := (J(t), 7 (t)) is a linear function of t. Note that f{a) = (J(a), 7 (a)) 
and f{a) = {DtJ{a),f{a)). Thus J(a) and DtJ{a) are orthogonal to 7 ( 0 ) if 
and only if / and its first derivative vanish at a, which happens if and only 
if / = 0. Similarly, if J is orthogonal to 7 at two points, then / vanishes at 
two points and is therefore identically zero. □ 

As a consequence of this lemma, it is easy to check that the space of nor¬ 
mal Jacobi fields is a (2n — 2)-dimensional subspace of T( 7 ), and the space 
of tangential ones is a 2-dimensional subspace. Every Jacobi field can be 
uniquely decomposed into the sum of a tangential Jacobi field plus a normal 
Jacobi field, just by decomposing its initial value and initial derivative. 
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FIGURE 10.4. A Jacobi field in normal coordinates. 


Computations of Jacobi Fields 

In Riemannian normal coordinates, half of the Jacobi fields are easy to 
write down explicitly. 

Lemma 10.7. Let p G M, let (cc*) he normal coordinates on a neigh¬ 
borhood It of p, and let j be a radial geodesic starting at p. For any 
W = W^'di G TpM, the Jacobi field J along 7 such that J(0) = 0 and 
DtJ{Q) = W {see Figure 10.4) is given in normal coordinates by the for¬ 
mula 


J{t) = tW^di. (10.4) 

Proof. An easy computation using formula (4.10) for covariant derivatives 
in coordinates shows that J satisfies the specified initial conditions, so it 
suffices to show that J is a Jacobi field. If we set V = 7 ( 0 ) G TpM, then 
we know from Lemma 5.11 that 7 is given in coordinates by the formula 
7 (t) = (tU^,..., <U"). Now consider the variation L given in coordinates 

by 


r(s, t) = {t{v^ + sjub, ■ ■ •, t(v"” + siu")). 

Again using Lemma 5.11, we see that L is a variation through geodesics. 
Therefore its variation field i9sr(0, t) is a Jacobi field. Differentiating r(s, t) 
with respect to s shows that its variation field is J{t). □ 
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For metrics with constant sectional curvature, we have a different kind 
of explicit formula for Jacobi fields—this one expresses a Jacobi field as a 
scalar multiple of a parallel vector field. 

Lemma 10.8. Suppose (M, g) is a Riemannian manifold with constant 
sectional curvature C, and ^ is a unit speed geodesic in M. The normal 
Jacobi fields along 7 vanishing at t = 0 are precisely the vector fields 

J{f) = u{t)E{f), (10.5) 

where E is any parallel normal vector field along 7 , and u{f) is given by 


U, c = 0; 

«(t) = I i?sin|, C=^>0; ( 10 . 6 ) 

[flsinhi C = -^<0. 

Proof. Since g has constant curvature, its curvature endomorphism is given 
by the formula of Lemma 8.10: 


R{X, Y)Z=C{{Y, Z)X - {X, Z)Y). 

Substituting this into the Jacobi equation, we find that a normal Jacobi 
field J satisfies 


= Dp+CJ, 


(10.7) 


where we have used the facts that \p = 1 and (J, 7 ) = 0. 

Since (10.7) says that the second covariant derivative of J is a multiple 
of J itself, it is reasonable to try to construct a solution by choosing a 
parallel normal vector field E along 7 and setting J{f) = u{t)E{f) for some 
function u to be determined. Plugging this into (10.7), we find that J is a 
Jacobi field provided u is a solution to the differential equation 

ilff) + Cuff) = 0. 

It is an easy matter to solve this ODE explicitly. In particular, the solutions 
satisfying t 6 ( 0 ) = 0 are constant multiples of the functions given in ( 10 . 6 ). 
This construction yields all the normal Jacobi fields vanishing at 0, since 
there is an (n — l)-dimensional space of them, and the space of parallel 
normal vector fields has the same dimension. □ 

Combining the formulas in the last two lemmas, we obtain our first appli¬ 
cation of Jacobi fields: explicit expressions for constant curvature metrics 
in normal coordinates. 
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FIGURE 10.5. A vector X tangent to a geodesic sphere is the value of a 
normal Jacobi field. 


Proposition 10.9. Suppose {M, g) is a Riemannian manifold with con¬ 
stant sectional curvature C. Let (x*) he Riemannian normal coordinates on 
a normal neighborhood U ofp G M, let\-\g be the Euclidean norm in these 
coordinates, and let r he the radial distance function. For any q G U — {p} 
and V G TqM, write V = + U-*-, where is tangent to the sphere 

{r = constant} through q and V-^ is a multiple of 8/dr. The metric g can 
be written 


9{V,V) = 



K = Q-, 

C=^>Q- ( 10 . 8 ) 




Proof. By the Gauss lemma, the decomposition V = is orthogo¬ 
nal, so \V\g = Since d/dr is a unit vector in both the g and 

g norms, it is immediate that Thus we need only compute 

\V^\9- 

Set X = , and let 7 denote the unit speed radial geodesic from p to 

q. By Lemma 10.7, X is the value of a Jacobi field J along 7 that vanishes 
at p (Figure 10.5), namely X = J{r), where r = d{p, q) and 


J{t) = -X^d,. 

r 


(10.9) 


Because J is orthogonal to 7 at p and q, it is normal by Lemma 10.6. 

Now J can also be written in the form J{f) = u{t)E{f) as in Lemma 
10.8. In this representation. 


AJ(0) = u{Q)E{Q) = E{Q), 
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since {((O) = 1 in each of the cases of (10.6). Therefore, since E is parallel 
and thus of constant length, 

= |J(r)p = \u{r)\^\E{r)\^ = \u{r)\^\E{0)\^ = \u{r)\^\D,jm^ ■ 

( 10 . 10 ) 


Observe that DtJ{0) = {l/r)X^di\p by (10.9). Since g agrees with g at p, 
we have 


\DtJm 


X^d, 



9- 


Inserting this into (10.10) and using formula (10.6) for u{r) completes the 
proof. □ 


Proposition 10.10. (Local Uniqueness of Constant Curvature 
Metrics) Let (M, g) and (M, g) be Riemannian manifolds with constant 
sectional curvatur^C. For any points p € M, p G M, there exist neighbor¬ 
hoods 11 of p and U of p and an isometry F: IX —> U. 

Proof. Choose p G M and p G M, and let It and IX be geodesic balls of small 
radius e around p and p, respectively^Riemannian normal coordinates give 
maps ip: U ^ Bg(0) C R" and IX ^ ^e(O) C R", under which both 
metrics are given by (10.8) (Figure 10.6). Therefore (p~^ oi^ is the required 
local isometry. □ 


Conjugate Points 

Our next application of Jacobi fields is to study the question of when 
the exponential map is a local diffeomorphism. If (M, g) is complete, we 
know that expp is defined on all of TpM, and is a local diffeomorphism 
near 0. However, it may well happen that it ceases to be even a local 
diffeomorphism at points far away. 

An enlightening example is provided by the sphere S^. All geodesics 
starting at a given point p meet at the antipodal point, which is at a dis¬ 
tance of ttR along each geodesic. The exponential map is a diffeomorphism 
on the ball iX,r/j(0), but it fails to be a local diffeomorphism at all points 
on the sphere of radius ttR in TpSf^ (Figure 10.7). Moreover, Lemma 10.8 
shows that each Jacobi field on vanishing at p has its first zero precisely 
at distance nR. 

On the other hand, formula (10.4) shows that if IX is a normal neigh¬ 
borhood of p (the image of a set on which exp^ is a diffeomorphism), no 
Jacobi field that vanishes at p can vanish at any other point in IX. We 
might thus be led to expect a relationship between zeros of Jacobi fields 
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FIGURE 10.6. Local isometry constructed from normal coordinate 
charts. 


and singularities of the exponential map (i.e., points where it fails to be a 
local diffeomorphism). 

If 7 is a geodesic segment joining p,q G M, q is said to be conjugate to 
p along 7 if there is a Jacobi field along 7 vanishing at p and q but not 
identically zero (Figure 10.8). The order or multiplicity of conjugacy is the 
dimension of the space of Jacobi fields vanishing at p and q. From the ex¬ 
istence and uniqueness theorem for Jacobi fields, there is an n-dimensional 
space of Jacobi fields that vanish at p; since tangential Jacobi fields vanish 
at most at one point, the order of conjugacy of two points p and q can be 
at most n — 1. This bound is sharp: Lemma 10.8 shows that if p and q are 
antipodal points on S^, there is a Jacobi field vanishing at p and q for each 
parallel normal vector field along 7 ; thus in that case p and q are conjugate 
to order exactly n — 1 . 

The most important fact about conjugate points is that they are pre¬ 
cisely the images of singularities of the exponential map, as the following 
proposition shows. 


Proposition 10.11. Suppose p G M, V G TpM, and q = exp^ V. Then 
expp is a local diffeomorphism in a neighborhood of V if and only if q is 
not conjugate to p along the geodesic j(t) = exp^tV) t G [0,1]. 
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FIGURE 10.7. The exponential map of the sphere. 



Proof. By the inverse function theorem, expp is a local diffeomorphism 
near V if and only if (expp)* is an isomorphism at V, and by dimensional 
considerations, this occurs if and only if (expp)* is injective at V. 

Identifying Tv{TpM) with TpM as usual, we can compute the push- 
forward (expp)* at V as follows: 


(expp)*VU 


ds 


expp(U -I- sW). 

s=0 


To compute this, we define a variation of 7 through geodesics (Figure 10.9) 
by 


rw(s, t) = expp t{V + sW). 
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FIGURE 10.9. Computing (expp)*lU. 


Then the variation field Jw{t) = 3srw(0,<) is a Jacobi field along 7 , and 

Jw(l) = (expp)*VU. 

Since W G TpM is arbitrary, there is an n-dimensional space of such Jacobi 
fields, and so these are all the Jacobi fields along 7 that vanish at p. (If 7 
is contained in a normal neighborhood, these are just the Jacobi fields of 
the form (10.4) in normal coordinates.) 

Therefore, (exp^)* fails to be an isomorphism at V when there is a vector 
W such that {expp)^,W = 0, which occurs precisely when there is a Jacobi 
field Jw along 7 with Jw( 0 ) = Jwio) = 0 . □ 

As Proposition 10.4 shows, the “natural” way to specify a unique Jacobi 
field is by giving its initial value and initial derivative. However, in a number 
of the arguments above, we have had to construct Jacobi fields along a 
geodesic 7 satisfying J(0) =0 and J{b) = W for some specific vector W. 
More generally, one can pose the two-point boundary problem for Jacobi 
fields: Given V G T^(^a)M and W G find a Jacobi field J along 

7 such that J(a) = V and J(b) = W. Another interesting property of 
conjugate points is that they are the obstruction to solving the two-point 
boundary problem, as the next exercise shows. 

Exercise 10.2. Suppose 7 : [a, ^ M is a geodesic. Show that the two- 

point boundary problem for Jacobi fields is uniquely solvable for every pair 
of vectors V G T^^a)M and W G if and only if 7 (a) and 7 ( 6 ) are not 

conjugate along 7 . 


'T^€Lth.C.I*L€Ltl.C.€LL "Ph-^SjJILA. 



The Second Variation Formula 185 

The Second Variation Formula 

Our last task in this chapter is to study the question of which geodesics 
are minimizing. In our proof that any minimizing curve is a geodesic, we 
imitated the first-derivative test of elementary calculus: If a geodesic 7 is 
minimizing, then the first derivative of the length functional must vanish 
for any proper variation of 7 . Now we imitate the second-derivative test: If 
7 is minimizing, the second derivative must be nonnegative. First, we must 
compute this second derivative. In keeping with classical terminology, we 
call it the second variation of the length functional. 

Theorem 10.12. (The Second Variation Formula) Letj: [a, 6] ^ M 

be a unit speed geodesic, F a proper variation of'), and V its variation field. 
The second variation of L(Ts) is given by the following formula: 

^ L{Ts) = J (\DtV-^\^ - dt, ( 10 . 11 ) 

where V'^ is the normal component ofV. 


Proof. As usual, write T = dtV and S = d^F. We begin, as we did when 
computing the first variation formula, by restricting to a rectangle (—s, s) x 
[ai-i,ai\ where F is smooth. From (6.3) we have, for any s, 

±T(r\ r i^^dt 

Differentiating again with respect to s, and using the symmetry lemma and 
Lemma 10.1, 


^ p / {D,DtS,T) {DtS,DsT) I {DtS,T)2{D,T,Ty 
V (T,T)i/ 2 + (T,T)i/2 2 (r,T)3/2 

_ n f{DtD,S + R{S,T)S,T) {DtS,DtS) {DtS,Tf 

Ja,., I \T\ \T\ |T|3 

Now restrict to s = 0, where \T\ = 1: 


^ _,,,,])= / {{DtDsS,T)-Rm{S,T,T,S) 

s=0 -fot-l 

+ \DtS\^ - {DtS,Tf) dt 


( 10 . 12 ) 
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Because DtT = Dt^ = 0 when s = 0, the first term in (10.12) can be 
integrated as follows: 


/ {DtD,S,T)dt= / -{D,S,T)dt 

' ai-i Jai-i 


= {DsS,T) 


(10.13) 


Notice that S'(s, t) = 0 for all s at the endpoints t = uq = a and t = = b 

because F is a proper variation, so DgS = 0 there. Moreover, along the 
boundaries {t = at} of the smooth regions, DgS = Ds{dsT) depends only 
on the values of F when t = Ui, and it is smooth up to the line {t = Oi} 
from both sides; therefore D^S is continuous for all (s,t). Thus when we 
insert (10.13) into (10.12) and sum over i, the boundary contributions from 
the first term all cancel, and we get 


ds'^ 


L{^s) 

s=0 



{DtS.Tf 


Rm{S,T,T, S)) dt 


s=0 


{DtV, 7 )^ - Rm{V, 7 , 7 , V)) dt. 

(10.14) 


Any vector field V along 7 can be written uniquely as V = + V'^, 

where is tangential and V'^ is normal. Explicitly, 

v^ = {v,j)r, v-^ = v-v^. 

Because Dtj = 0, it follows that 

DtV^ = = (DtV)^; DtV^ = (DtV)^. 


Therefore, 

\DtV\^ = |(AE)^p + \{DtV)^\^ = {DtV,jf + 


Also, 


Rm{V,'j,j,V) = i?m(E-^, 7 , 7 ,E-^) 

because Rm{j, 7 , •, •) = i?m(-, •, 7 , 7 ) = 0. Substituting these relations into 
(10.14) gives (10.11). □ 

It should come as no surprise that the second variation depends only 
on the normal component of V ; intuitively, the tangential component of V 
contributes only to a reparametrization of 7 , and length is independent of 
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parametrization. For this reason, we generally apply the second variation 
formula only to variations whose variation fields are proper and normal. 

We define a symmetric bilinear form I, called the index form, on the 
space of proper normal vector fields along 7 by 

I{V,W)= [ {{DtV,DtW) - Rm{V,^,j,W))dt. (10.15) 

J a 

You should think of I{V, W) as a sort of “Hessian” or second derivative of 
the length functional. Because every proper normal vector field along 7 is 
the variation field of some proper variation, the preceding theorem can be 
rephrased in terms of the index form in the following way. 

Corollary 10.13. If T is a proper variation of a unit speed geodesic 7 
whose variation field is a proper normal vector field V, the second variation 
of L(Ts) is I{V,V). In particular, if j is minimizing, then I{V,V) > 0 for 
any proper normal vector field along 7 . 

The next proposition gives another expression for I, which makes the 
role of the Jacobi equation more evident. 

Proposition 10.14. For any pair of proper normal vector fields V, W 
along a geodesic segment 7 , 

I{V, W) = - f {D\V + Riy, 7 ) 7 , w)dt-^ {yOtV, W{ai )), 

i=i 

(10.16) 

where {oi} are the points where V is not smooth, and AiDtV is the jump 
in DtV at t = Qi- 


Proof. On any subinterval [ai-i,ai\ where V and W are smooth. 


- {DtV, W) = {ny, W) + {DtV, DtW). 
Thus, by the fundamental theorem of calculus, 

/ {DtV,DtW)dt = - {DlV,W)+ {DtV,W) 


Summing over i, and noting that W is continuous at < = and W{a) = 
W{b) = 0, we get (10.16). □ 


Geodesics Do Not Minimize Past Conjugate Points 

In this section, we use the second variation to prove another extremely 
important fact about conjugate points: No geodesic is minimizing past its 
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FIGURE 10.10. Constructing a vector field X with I{X,X) < 0. 


first conjugate point. The geometric intuition is as follows: Suppose 7 is 
minimizing, li q = 7 ( 6 ) is conjugate to p = 7 (a) along 7 , and J is a Jacobi 
field vanishing at p and q, there is a variation of 7 through geodesics, all 
of which start at p. Since J{q) = 0, we can expect them to end “almost” 
at q. If they really did all end at q, we could construct a broken geodesic 
by following some from p to q and then following 7 from q to 7(6 + e), 
which would have the same length and thus would also be a minimizing 
curve. But this is impossible: as the proof of Theorem 6.6 shows, a broken 
geodesic can always be shortened by rounding the corner. 

The problem with this heuristic argument is that there is no guarantee 
that we can construct a variation through geodesics that actually end at q. 
The proof of the following theorem is based on an “infinitesimal” version 
of rounding the corner to obtain a shorter curve. 

Theorem 10.15. If j is a geodesic segment from p to q that has an inte¬ 
rior conjugate point to p, then there exists a proper normal vector field X 
along 7 such that I{X,X) < 0. In particular, 7 is not minimizing. 

Proof. Suppose 7 : [0,5] ^ M is a unit speed parametrization of 7 , and 7 (a) 
is conjugate to 7(0) for some 0 < a < b. This means there is a nontrivial 
normal Jacobi field J along 7 |[o,a] that vanishes at t = 0 and t = a. Define 
a vector field V along all of 7 by 


V{t) 


J{t), tG[0,a]; 
0 , t G [a, b]. 


This is a proper, normal, piecewise smooth vector field along 7 . 

Let W he a smooth proper normal vector field along 7 such that W ( 6 ) 
is equal to the jump ADtV at t = b (Figure 10.10). Such a vector field is 
easily constructed in local coordinates and extended to all of 7 by a bump 
function. Note that ADtV = —DtJ{b) is not zero, because otherwise J 
would be a Jacobi field satisfying J{b) = DtJ{b) = 0, and thus would be 
identically zero. 
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FIGURE 10.11. Geodesics on the cylinder. 


For small positive £, let = V + eW. Then 

I(X„ X,) = I(V + £TT, U + eW) 

= I{V, V) + 2sI{V, W) + e^I{W, W). 

Since V satisfies the Jacobi equation on each subinterval [0,a] and [a, 6], 
and V{a) = 0, (10.16) gives 

I{V,V) = -{XDtV,V{a)) = {). 


Similarly, 

I{V,W) = - {ADtV,W{b)) = -\W{b)\\ 

Thus 

I{X,,X,) = -2e\W{b)\^ +s^I{W,W). 

If we choose e small enough, this is strictly negative. □ 

There is a far-reaching quantitative generalization of Theorem 10.15 
called the Morse index theorem, which we do not treat here. The index 
of a geodesic segment is defined to be the maximum dimension of a lin¬ 
ear space of proper normal vector fields on which I is negative definite. 
Roughly speaking, the index is the number of independent directions in 
which 7 can be deformed to decrease its length. (Analogously, the index of 
a critical point of a function on R” is defined as the number of negative 
eigenvalues of its Hessian.) The Morse index theorem says that the index 
of any geodesic segment is finite, and is equal to the number of its interior 
conjugate points counted with multiplicity. (Proofs can be found in [CE75], 
[dC92], or [Spi79, volume 4].) 
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It is important to note, by the way, that the converse of Theorem 10.15 is 
not true: a geodesic without conjugate points need not be minimizing. For 
example, on the cylinder x R, there are no conjugate points along any 
geodesic; but no geodesic that wraps more than halfway around the cylinder 
is minimizing (Figure 10.11). Therefore it is useful to make the following 
definitions. Suppose 7 is a geodesic starting at p. Let B = sup {6 > 0 : 
7 |[o^b] is minimizing}. If i? < 00 , we call q = j{B) the cut point of p along 
7 . The cut locus of p is the set of all points q G M such that q is the cut 
point of p along some geodesic. (Analogously, the conjugate locus of p is 
the set of points q such that q is the first conjugate point to p along some 
geodesic.) The preceding theorem can be interpreted as saying that the cut 
point (if it exists) occurs at or before the first conjugate point along any 
geodesic. 
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Problems 

10-1. Extend the result of Lemma 10.8 by finding a formula for all normal 
Jacobi fields in the constant curvature case, not just the ones that 
vanish at 0 . 

10-2. Suppose that all sectional curvatures of M are nonpositive. Use the 
results of this chapter to show that the conjugate locus of any point 
is empty. [We will give a more geometric proof in the next chapter.] 

10-3. Suppose (M, g) is a Riemannian manifold and p G M. Show that the 
second-order Taylor series of g in normal coordinates centered at p is 

g^j(x) = ^ X! ^^kljX^X^ + 0{\x\^). 

kl 

[Hint: Let j{t) = ..., tU") be a radial geodesic and J(t) = 

tW^di a Jacobi field along 7 , and compute the first four t-derivatives 
of I at t = 0 in two ways.] 





11 

Curvature and Topology 


In this final chapter, we bring together most of the tools we have devel¬ 
oped so far to prove some significant local-global theorems relating curva¬ 
ture and topology. Before treating the topological theorems themselves, we 
prove some comparison theorems for manifolds whose curvature is bounded 
above. These comparisons are based on a simple ODE comparison theo¬ 
rem due to Sturm, and show that if the curvature is bounded above by a 
constant, then the metric in normal coordinates is bounded below by the 
corresponding constant curvature metric. 

We then state and prove several of the most important local-global the¬ 
orems of Riemannian geometry. The first one, the Cartan-Hadamard theo¬ 
rem, topologically characterizes complete, simply-connected manifolds with 
nonpositive sectional curvature: they are all diffeomorphic to R". The sec¬ 
ond, Bonnet’s theorem, says that a complete manifold with sectional cur¬ 
vatures bounded below by a positive constant must be compact and have 
a finite fundamental group; a generalization called Myers’s theorem allows 
positive sectional curvature to be replaced by positive Ricci curvature. The 
last theorem in this chapter says that complete manifolds with constant 
sectional curvature are all quotients of the model spaces by discrete sub¬ 
groups of their isometry groups. 
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Some Comparison Theorems 

We begin this chapter by proving that an upper bound on sectional curva¬ 
ture produces a lower bound on Jacobi fields, on the distance to conjugate 
points, and on the metric in normal coordinates. Our starting point is the 
following very classical comparison theorem for ordinary differential equa¬ 
tions. This result can be found in various guises in the literature (cf., for 
example, [Spi79, volume 4] or [BR78, Theorem II.6]), but all are essentially 
equivalent to the one presented here. 

Theorem 11.1. (Sturm Comparison Theorem) Suppose u and v are 

differentiable real-valued functions on [0,T], twice differentiable on (0,T), 
and u > 0 on (0,T). Suppose further that u and v satisfy 

u{t) a{t)u{f) = 0 
v{t) a{t)v{f) > 0 

m( 0) = v(0) = 0, u(0) = u(0) > 0 

for some function a: [0,T] ^ R. Then v{t) > u{t) on [0,r]. 

Proof. Consider the function f{t) = v{t)/u{t) defined on (0,r). It follows 
from I’Hopital’s rule that limj^o/(h = ■c(0)/m( 0) = 1. Since / is differ¬ 
entiable on (0,T), if we could show that / > 0 there it would follow from 
elementary calculus that / > 1 and therefore v > u on (0,T), and by 
continuity also on [0,T]. Differentiating, 

d /v\ iiu — vii 
dt Vu/ 

Thus to show / > 0 it would suffice to show vu — vu> 0. Since u(0)u(0) — 
u(0)'i((0) = 0, we need only show this expression has nonnegative derivative. 
Differentiating again and substituting the ODE for u, 

d ,. ., .... .... 

— ivu — vu) = vu vu — vu — vu = vu avu > 0 . 
dt 

This proves the theorem. □ 


Theorem 11.2. (Jacobi Field Comparison Theorem) Suppose {M,g) 
is a Riemannian manifold with all sectional curvatures bounded above by a 
constant C. If ^ is a unit speed geodesic in M, and J is any normal Jacobi 
field along 7 such that J( 0 ) = 0 , then 


\m > < 


t\Dtjm 

i?sin ^ |A-^( 0 )| 

i?sinh \DtJ{0)\ 
H 


for 0 <t, 
for 0 <t < ttR, 
for 0 <t, 


^/C' = 0; 


^fC 

ifC 


1 

Iff 


> 0 ; 






Some Comparison Theorems 195 


Proof. The function |J(<)| is smooth wherever J{t) ^ 0. Using the Jacobi 
equation, we compute 

I 71 _ ^ 

df^' dt (J, J)i/2 

{Dfj,J) {DtJ,DtJ) {DtJ,J)^ 

(J,J)i/2+ (J,J)i/2 (J,J)3/2 

(i?(J,7)7,^) , lAJp {DtJ,J)^ 

\J\ ^ \J\ |J|3 ■ 


By the Schwartz inequality, {DtJ, J)^ < \DtJ\‘^\J\‘^, so the sum of the last 
two terms above is nonnegative. Thus 

I 71 ^ i?m(J, 7 , 7 , J) 

dt^''- |J| 


Since (J, 7 ) = 0 and I 7 I = 1, i?TO( J, 7 , 7 , J)/| Jp is the sectional curvature 
of the plane spanned by J and 7 . Therefore our assumption on the sectional 
curvatures of M guarantees that i?TO( J, 7 , 7 , J)/| Jp < C, so |J| satisfies 
the differential inequality 


df_ 

dt'^ 


\J\>-c\J\ 


wherever | J| > 0. 

We wish to use the Sturm comparison theorem to compare | J| with the 
solution tt to it + Cu = 0 given by (10.6). To do so, we need to arrange 
that fi| J|/dt = 1 at t = 0, because m( 0) = 1. Multiplying J by a positive 
constant, we may assume without loss of generality that |Z1(J(0)| = 1. 
From Lemma 10.7, J can be written near t = 0 as J{f) = tW{t), where 
IT is a smooth vector field. (It is the one given in normal coordinates by 
W{t) = W^di for some constants ,..., IT", but that is irrelevant here.) 
Therefore, 


d 

dt 


t=0 


lim 


t 




wm = 1 . 


Now the Sturm comparison theorem applies to show that | J| > u, pro¬ 
vided I J| is nonzero (to ensure that it is smooth). The fact that d\J\/dt = 1 
at t = 0 means |J| > 0 on some interval (0,£), and |J| cannot attain its 
first zero before u does without contradicting the estimate \J\ > u. Thus 
I J| > u as long as M > 0, which proves the theorem. □ 


Corollary 11.3. (Conjugate Point Comparison Theorem) Suppose 
all sectional curvatures of (M, g) are bounded above by a constant C. If 
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C < 0, then no point of M has conjugate points along any geodesic. If 
C = 1/E? > Q, then the first conjugate point along any geodesic occurs at 
a distance of at least ttR. 

Proof. If (7 < 0, the Jacobi field comparison theorem implies that any 
nontrivial normal Jacobi field vanishing at t = 0 satisfies |J(t)| > 0 for 
all < > 0. Similarly, if C > 0, then |J(t)| > (constant) sin(t/i?) > 0 for 
0 < t < ttR. □ 

Corollary 11.4. (Metric Comparison Theorem) Suppose all sectional 
curvatures of {M,g) are hounded above by a constant C. In any normal 
coordinate chart, g{V,V) > gdY^y), where gc is the constant curvature 
metric given by formula (10.8). 

Proof. Decomposing a vector V into components tangent to the ge¬ 
odesic sphere and V-^ tangent to the radial geodesics as in the proof of 
Proposition 10.9 gives 

g{V,V) = g{V^,VY + 9{V^ ,V^)- 

Just as in that proof, g(P-*-,P-*-) = g(P-*-,P-*-) = gcY'^YY- Also, 
is the value of some normal Jacobi field vanishing at t = 0, so the Jacobi 
field comparison theorem gives g{V"^ ,V"^) > gciV"^ CJ 

The general information provided by these results is that a nonpositive 
upper bound on curvature forces geodesics to “spread out,” while a positive 
upper bound prevents them from converging too fast. 


Manifolds of Negative Curvature 

Our first major local-global theorem in arbitrary dimensions is the follow¬ 
ing characterization of simply-connected manifolds of nonpositive sectional 
curvature. 

Theorem 11.5. (The Cartan Hadamard Theorem) If M is a com¬ 
plete, connected manifold all of whose sectional curvatures are nonpositive, 
then for any point p € M, exp ^: TpM ^ M is a covering map. In par¬ 
ticular, the universal covering space of M is diffeomorphic to R". If M is 
simply connected, then M itself is diffeomorphic to R”. 

Proof. The assumption of nonpositive curvature guarantees that p has no 
conjugate points along any geodesic, which can be shown by using either 
the conjugate point comparison theorem above or Problem 10-2. Therefore, 
by Proposition 10.11, exp^ is a local diffeomorphism on all of TpM. 

Let g be the (variable-coefficient) 2-tensor field exp* g defined on TpM. 
Because exp* is everywhere nonsingular, g is a Riemannian metric, and 


^uHjc. T^€Lth.c.i*L€Lti.c.€LL 



Manifolds of Negative Curvature 197 




FIGURE 11.1. Lifting geodesics. 


expp : {TpM, g) (M, g) is a local isometry. It then follows from Lemma 
11.6 below that exp^ is a covering map. The remaining statements of 
the theorem follow immediately from uniqueness of the universal covering 
space. □ 

Lemma 11.6. Suppose M and M are connected Riemannian manifolds, 
with M complete, and tt: M ^ M is a local isometry. Then M is complete 
and TT is a covering map. 

Proof. A fundamental property of covering maps isjTe path-lifting prop¬ 
erty: any continuous path 7 in M lifts to a path 7 in M such that 7 ro 7 = 7 . 
We begin by proving that tt possesses the path-lifting property for geodes¬ 
ics: If p G M, p G 7 r“^(p), and j: I ^ M is a, geodesic starting at p, then 7 
has a unique lift starting at p (Figure 11.1). The lifted curve is necessarily 
also a geodesic because tt is a local isometry. ^ 

To prove the path-lifting property for geodesics, let V = 7(0) and V = 
7 rL^ 7 ( 0 ) G TpM (which is well defined because tt* is an isomorphism at 
each point), and let 7 be the geodesic in M with initial point p and initial 
velocity V. Because M is complete, 7 is defined for all time. Since tt is a 
local isometry, it takes geodesics to geodesics; and since by construction 
^( 7 ( 0 )) = 7 ( 0 ) and 7r*7(0) = 7(0), we must have tt o 7 = 7 on /. In 
particular, tt o 7 is a geodesic defined for all t that coincides with 7 on /, 
so 7 extends to all of R and thus M is complete. 

Next we show that tt is surjective. Choose some point p G M, write p = 
7 r(p), and let q G M be arbitrary. Because M is connected and complete, 
there is a minimizing geodesic segment 7 from p to q. Letting 7 be the lift 
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TT 



FIGURE 11.2. Proof that Ua and IL/s are disjoint. 


of 7 starting at p and r = d{p,q), we have 7r(7(r)) = 7(r) = q, so q is in 
the image of tt. 

To show that tt is a covering map, we need to show that every point 
p G M has a neighborhood It that is evenly covered, which means that 
Tr~^{U) is the disjoint union of open sets IXa such that tt: Ua —> It is a 
diffeomorphism. We will show, in fact, that any geodesic ball It = Bg{p) is 
evenly covered. ^ 

Let 7r“^(p) = {pa}, and for each a let Ua denote the metric ball of radius 
e around Pa (we are not claiming that Ua is a geodesic ball). The first step 
is to show that the various sets Ua are disjoint. For any a ^ j3, there is a 
minimizing geodesic 7 from pa to pp because M is complete. The projected 
curve 7 := TT o 7 is a geodesic from p to p (Figure 11.2). Such a geodesic 
must leave U and re-enter it (since all geodesics passing through p and lying 
in It are radial line segments), and thus must have length at least 2e. This 
means d{pa,P/ 3 ) > 2s, and thus by the triangle inequality Ua H IX/j = 0. 

The next step is to show that tt^^CU) = Ua- Since tt is an isometry, 
it clearly maps Ua into U. Thus we need only show tt^^CU) C Ua- Let 
q G 7r“^('U). This means that q := 7r(g) G U, so there is a minimizing 
geodesic 7 in It from q to p, and r = d{q,p) < S- Letting 7 be the lift of 
7 starting at q, it follows that 7r(7(r)) = j(r) = p (Figure 11.3). Therefore 
7(r) = Pa for some a, and d{q,Pa) < L{^) = r < s, so q G Ua- 
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Uo 


j(r) 



FIGURE 11.3. Proof that 7r-i(lt) C Uc'^c-- 


It remains only to show that tt : IXq, ^ It is a diffeomorphism for each a. 
It is certainly a local diffeomorphism (because tt is). It is bijective because 
its inverse can be constructed explicitly: it is the map sending each radial 
geodesic starting at p to its lift starting at Pa- This completes the proof. □ 

Because of this theorem, a complete, simply-connected Riemannian 
manifold with nonpositive sectional curvature is called a Cartan-Hadamard 
manifold. An immediate consequence of the Cartan-Hadamard theorem is 
that there are stringent topological restrictions on which manifolds can 
carry metrics of nonpositive sectional curvature. For example, if M is a 
product of compact manifolds Mi x M 2 where either Mi or M 2 is simply 
connected (such as, for example, x S'^), then any metric on M must 
have positive sectional curvature somewhere. With a little algebraic topol¬ 
ogy, one can obtain more information: for example, any manifold whose 
universal cover is contractible is aspherical, which means that the higher 
homotopy groups 7r^,(M) vanish for A: > 1 (see [Whi78]), so many manifolds 
cannot admit metrics of nonpositive curvature. 


Manifolds of Positive Curvature 

Next we consider manifolds with positive sectional curvature. Our compar¬ 
ison theorems do not tell us anything about manifolds whose curvature is 
bounded below instead of above. Nevertheless, clever analysis of the index 
form can still lead to significant conclusions, as the proof of the follow¬ 
ing theorem shows. We need one definition: the diameter of a Riemannian 
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FIGURE 11.4. The diameter of Sr is tvR. 


manifold is 


diam(M) := sup{(i(p, q) : p,q G M}. 

Note that the diameter of the round sphere of radius R is ttR (not 2R), since 
the Riemannian distance between antipodal points is ttR (Figure 11.4). 

Theorem 11.7. (Bonnet’s Theorem) Let M he a complete, connected 
Riemannian manifold all of whose sectional curvatures are hounded helow 
hy a positive constant IjR:^. Then M is compact, with a finite fundamental 
group, and with diameter less than or equal to ttR. 

Proof. The first step is to show that the diameter of M is no greater than 
ttR. Suppose the contrary: then there are points p,q G M, and (by the 
Hopf-Rinow theorem) a minimizing unit speed geodesic segment 7 from p 
to q of length L > ttR. Since 7 is minimizing, its index form is nonnegative. 
We will derive a contradiction by constructing a proper normal vector field 
V along 7 such that I{V, V) < 0. 

Let E be any parallel normal unit vector field along 7 , and let 

V{t) = (^sin^^E{t). 

Observe that V vanishes at f = 0 and t = L, so V is a proper normal vector 
field along 7 . (Note the similarity between V and the formulas (10.5), (10.6) 
for Jacobi fields on the sphere of radius L/tt.) By direct computation, 

DtVff) = ^(cos ^'^Eit), 

DtV(t) = -^(^sin^^Eit), 
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and so 
I{V,V) 


- {DfV + R{V,j)j,V)dt 
Jo 


Since E and 7 are orthonormal, Rm{E,j,'^,E) is equal to the sectional 
curvature of the plane they span, and so our estimate on sectional curvature 
gives 


I{V, V) < 





dt < 0. 


Therefore our geodesic of length L > ttR cannot be minimizing, so the 
diameter of M is at most rri?. 

To show that M is compact, we just choose a basepoint p and note 
that every point in M can be connected to p by a geodesic segment of 
length at most ttR. Therefore, expj,: ^ M is surjective, so M is 

the continuous irn^e of a compact set. 

Finally, let tt: M ^JW denote the universal covering space of M, with 
the metric g := TT*g. M is complete by the result of Problem 6-11, and 
g also has sectional curvatures bounded below by 1/i?^, so M is compact 
by the argument above. By the theory of covering spaces (see [Mas67, 
Corollary V.7.5]), there is a one-to-one correspondence between 7ri(M) and 
the inverse image 7r“^(p) of any point p G M. If were infinite, 

therefore, 7r“^(p) would be an infinite discrete set in M, contradicting the 
compactness of M. Thus tti{M) is finite. □ 


It is rather surprising that the conclusions of Bonnet’s theorem hold 
with the much weaker assumption of strictly positive Ricci tensor, as the 
following theorem shows. 

Theorem 11.8. (Myers’s Theorem) Suppose M is a complete, con¬ 
nected Riemannian n-manifold whose Ricci tensor satisfies the following 
inequality for all V G TM: 

Rc{V,V)>'^\V\fi 

Then M is compact, with a finite fundamental group, and diameter at most 
ttR. 


Proof. As in the proof of Bonnet’s theorem, it suffices to prove the diameter 
estimate. As before, let 7 be a minimizing unit speed geodesic segment of 
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length L > ttR. Let {Ei,..., En) be a parallel orthonormal frame along 7 
such that En = 7 , and for each i = 1 ,..., n — 1 let Vi be the proper normal 
vector field 


By the same computation as before, 

= ^ - Rm{Ei,j,'y,E,)^ dt. ( 11 . 1 ) 

In this case, we cannot conclude that each of these terms is negative. 
However, because {Ei} is an orthonormal frame, the Ricci tensor at points 
along 7 is given by 



Rc{i,i) ='^Rrn{Ei,j,'j,E,) = Rm{Ei,j,j, E^) 




2 = 1 


(because 7,7, ^n) = -^^^(7?7? 7, 7 ) = 0)- Therefore, summing 

(11.1) over i gives 

^ I(Vi, = (^sin^ 7^ |^(n “ 1) ^ - Rc{j, 7 )^ dt 


< 


•„2 f ~ n — 1 


sm — 
L 


L2 


i?2 


dt < 0 . 


This means at least one of the terms I {Vi,Vi) must be negative, and again 
we have a contradiction to 7 being minimizing. □ 

One of the most useful applications of Myers’s theorem is to Einstein 
metrics. If g is a complete Einstein metric with positive scalar curvature, 
then Rc = ^Sg satisfies the hypotheses of the theorem; it follows that com¬ 
plete, noncompact Einstein manifolds must have nonpositive scalar curva¬ 
ture. On the other hand, it is possible for complete, noncompact manifolds 
to have strictly positive Ricci or even sectional curvature, as long as it gets 
arbitrarily close to zero, as the following example shows. 

Exercise 11.1. Let M C be the paraboloid {{x^,... ,x^,y) : y = 

|a:|2} with the induced metric (see Problem 8-2). Show that M has strictly 
positive sectional curvature everywhere. 


There is much more that can be said in the case of positive sectional 
curvature, using more elaborate versions of the methods we have developed 
here. One question you might already have asked yourself is whether there 
are analogues of our comparison theorems when the curvature is bounded 
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FIGURE 11.5. Setup for the Rauch comparison theorem. 


below instead of above. Our proof of the Jacobi field comparison theorem 
definitely does not work if a lower bound on curvature is substituted for 
the upper bound, because the step involving the Schwartz inequality is not 
reversible (except in dimension 2—see Problem 11-1). 

Nonetheless, the analogues of all three of these results are true with 
curvature bounded above, but the proofs are considerably more involved. 
The key fact is the following very general comparison theorem. The proof 
would take us too far afield, so we state it without proof. 

Theorem 11.9. (Rauch Comparison Theorem) Let M and M he Rie- 
mannian manifolds, let 7 : [0,T] ^ M and 7 : [0,T] ^ M be unit speed 
geodesic segments such that 7 ( 0 ) has no conjugate points along 7 , and let 
J, J he normal Jacobi fields along 7 and 7 such that J(0) = J(0) = 0 
and \DtJ{0)\ = |I?(J(0)| {Figure 11.5). Suppose that the sectional curva¬ 
tures of M and M satisfy K{Ii) < K{Ii) whenever 11 C is a 2- 

plane containing j{t) and 11 C T;y(^e)M is a 2-plane containing f{t). Then 
|•^(f)| > l'^(^)l for all t G [0,T]. 

You can find proofs in [dC92], [CE75], and [Spi79, volume 4]. Letting 
M be one of our constant curvature model spaces, we recover the Jacobi 
field comparison theorem above. On the other hand, if instead we take M 
to have constant curvature, we get the same result with the inequalities 
reversed. 

The most successful applications of the Rauch comparison theorem have 
been to prove “pinching theorems.” A manifold is said to be 6-pinched if 
all sectional curvatures satisfy 

for some 6 , R > 0, and strictly 6-pinched if the first inequality is strict. The 
following celebrated theorem was originally proved by Marcel Berger and 
Walter Klingenberg in the early 1960s. 
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Theorem 11.10. (The Sphere Theorem) Suppose M is a complete, 
simply-connected, Riemannian n-manifold that is strictly ^-pinched. Then 
M is homeomorphic to S". 

The proof, which can be found in [CE75] or [dC92], is an elaborate ap¬ 
plication of the Rauch comparison theorem together with the Morse index 
theorem mentioned in Chapter 10. This result is sharp, at least in even di¬ 
mensions, because the Fubini-Study metrics on complex projective spaces 
are ^-pinched (Problem 8-12). 

Using techniques of partial differential equations can lead to even 
stronger conclusions in some cases. For instance, in 1982, Richard Hamilton 
[Ham82] proved the following very striking result on 3-manifolds. 

Theorem 11.11. (Hamilton) Suppose M is a simply-connected compact 
Riemannian 3-manifold with strictly positive Ricci curvature. Then M is 
diffeomorphic to S^. 


Manifolds of Constant Curvature 

Our last application of Jacobi field techniques is to give a global character¬ 
ization of complete manifolds of constant sectional curvature. 

Theorem 11.12. (Uniqueness of Constant Curvature Metrics) Let 

M be a complete, simply-connected Riemannian n-manifold with constant 
sectional curvature C. Then M is isometric to one of the model spaces R", 
S^, or H^. 

Proof. It is easiest to handle the cases of positive and nonpositive sectional 
curvature separately. First suppose C < 0. Then the Cartan-Hadamard 
theorem says that for any p G M, expp: TpM ^ M is a covering map. 
Since M is simply connected, it is a diffeomorphism. The pulled-back metric 
g := exp* g, therefore, is a globally defined metric on TpM with constant 
sectional curvature C, and exp^: {TpM, g) {M, g) is a global isometry. 
Moreover, since Fuclidean coordinates for TpM are normal coordinates for 
g, it must be given by one of the cases of formula (10.8); these in turn are 
globally isometric to R" if C = 0 and if C = —l/Rf. 

In case C = 1/i?^ > 0, we have to argue a little differently. Let {N,—N} 
be the north and south poles in S^, and observe that exp^ is a diffeo¬ 
morphism from R^/j(0) C TpfSf^ to — {—A^}. On the other hand, 
choosing any point p G M, the conjugate point comparison theorem shows 
that p has no conjugate points closer than nR, so exp^ is at least a lo¬ 
cal diffeomorphism on i?7rfi(0) C TpM. If we choose any linear isometry 
(p: ^ TpM (Figure 11.6), then {exppOp)*g and expj^, 5 ^ are both 

metrics of constant curvature 1/R^ on i?K(0) C TvS^, and Euclidean co¬ 
ordinates on T]\[S^ are normal coordinates for both (since the radial line 
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FIGURE 11.6. Constructing an isometry when (7 > 0. 


segments are geodesics). Therefore, Proposition 10.9 shows that they are 
equal, so the map <I): — {—iV} ^ M given by <& = expp o(p o exp)^^ is a 

local isometry. 

Now choose any point Q other than N or —N, and let q = d)((5) G 
M. Using the isometry ^ = d)*: TqS'^ TqM, we can construct a similar 
map d) = exp^ oip o expg —Q} M, and the same argument 

shows that is a local isometry. Because d)((3) = ^(Q) and = $* at Q 
by construction, <I> and <I> must agree where they overlap by Problem 5-7. 
Putting them together, therefore, we get a globally defined local isometry 
F: —> M. After noting that M is compact by Bonnet’s theorem, we 

complete the proof by appealing to Exercise 11.2 below. □ 

Exercise 11.2. Show that any local diffeomorphism between compact, 

connected manifolds is a covering map. 

Theorem 11.12 is a special case of a rather more complicated result, 
the Cartan-Ambrose-Hicks theorem, which says roughly that two simply- 
connected manifolds, all of whose sectional curvatures at corresponding 
points are equal to each other, must be isometric. The main idea of the proof 
is very similar to what we have done here; the trick is in making precise 
sense of the notion of “corresponding points,” and of what it means for 
nonconstant sectional curvatures to be equal at different points of different 
manifolds. See [Wol84] or [CE75] for the complete statement and proof. 

Combining our classification of simply-connected manifolds of constant 
curvature with the characterization of their isometry groups given in Prob- 
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lem 5-8, we obtain finally the following description of all complete manifolds 
of constant curvature. 

Corollary 11.13. (Classification of Constant Curvature Metrics) 

Suppose M is a complete, connected Riemannian manifold^with constant 
sectional curvature. Then M is isometric to M/T, where M is one of the 
constant curvature model spaces R", S^, or H^, and T is a discrete sub¬ 
group of3{M), isomorphic to and acting freely and properly dis- 

continuously on M. 

Proof. If TT: M ^ M is the universal covering space of M with the lifted 
metric g = '!T*g, the preceding theorem shows that (M, g) is isometric to one 
of the model spaces. From covering space theory [Sie92, Mas67] it follows 
that the group F of covering transformations is isomorphic to 7ri(M) and 
acts freely ar^ properly discontinuously on M, and M is diffeomorphic to 
the quotient M /F. Moreover, if ip is any covering transformation, noip = tt, 
and so ip*g = (p*TT*g = TT*g = g, so T acts by isometries. Finally, suppose 
C F is an infinite set with an accumulation point in 3{M). Since the 
action of F is fixed-point free, for any point p G M the set {(pi{p)} is infinite, 
and by continuity of the action it has an accumulation point in M. But 
this is impossible, since the points {ipi{p)} all project to the same point in 
M, and so form a discrete set. Thus F is discrete in 3{M). □ 

A complete, connected Riemannian manifold with constant sectional cur¬ 
vature is called a space form. This result essentially reduces the classifi¬ 
cation of space forms to group theory. Nevertheless, the group-theoretic 
problem is still far from easy. 

The spherical space forms were classified in 1972 by Joseph Wolf [Wol84]; 
the proof is intimately connected with the representation theory of finite 
groups. Although the only 2-dimensional ones are the sphere and the pro¬ 
jective plane, already in dimension 3 there are many interesting examples. 
Some notable ones are the lens spaces obtained as quotients of C 
by cyclic groups rotating the two complex coordinates through different 
angles; and the quotients of SO(3) (which is diffeomorphic to RP^ and is 
therefore already a quotient of S^) by the dihedral groups, the symmetry 
groups of regular 3-dimensional polyhedra. 

The complete classification of Euclidean space forms is known only in low 
dimensions. For example, there are 10 classes of nondiffeomorphic compact 
Euclidean space forms of dimension 3, and 75 classes in dimension 4. The 
fundamental groups of compact Euclidean space forms are examples of 
crystallographic groups, which are discrete groups of Euclidean isometries 
with compact quotients, and which have been studied extensively by physi¬ 
cists as well as geometers. (A quotient of R” by a crystallographic group 
is a space form provided it is a manifold, which is true whenever the crys¬ 
tallographic group has no elements of finite order.) It is known in general 
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that Euclidean space forms are quotients of flat tori, but the classification 
in higher dimensions is still elusive. See [Wol84] for a complete survey of 
the state of the art as of 1972. 

Finally, the study of hyperbolic space forms is a vast and rich subject, 
the surface of which has barely been scratched. 
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Problems 

11-1. Show that when the dimension of M is 2, the argument of Theorem 
11.2 can be adapted to give an explicit upper bound for \J{t)\ when 
the Gaussian curvature is bounded below. 

11-2. Adapt the argument of Theorem 11.1 to prove the following general¬ 
izations of the Sturm comparison theorem, also due to Sturm. 

(a) Suppose a, b are continuous functions on an open interval I with 
a > b, and u, v are nontrivial solutions to 

u{t) + a{t)u{t) = 0 
v{t) + b(t)v(t) = 0 

on I. Then between any two zeros of v there must be at least 
one zero of u, unless a = b and u and v are constant multiples 
of each other. 

(b) (Sturm Separation Theorem) Suppose a is continuous on an 
interval I, and ui,U 2 are two linearly independent solutions on 
/ to 

u{t) + a{t)u{t) = 0. 

Show that the zeros of ui and U 2 are strictly alternating. 

11-3. Suppose M is a Cartan-Hadamard manifold whose sectional cur¬ 
vature is bounded above by a negative constant C. Show that the 
volume of any geodesic ball in M at least as large as that of the ball 
with the same radius in hyperbolic space of curvature C. 
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acceleration 

Euclidean, 48 

of a curve on a manifold, 58 
of a plane curve, 3 
tangential, 48 

adapted orthonormal frame, 43, 
133 

adjoint representation, 46 
admissible 
curve, 92 
family, 96 

affine connection, 51 
aims at a point, 109 
algebraic Bianchi identity, 122 
alternating tensors, 14 
ambient 

manifold, 132 
tangent bundle, 132 
Ambrose 

Cartan-Ambrose-Hicks 
theorem, 205 

angle 

between vectors, 23 
tangent, 156, 157 
angle-sum theorem, 2, 162, 166 


arc length 

function, 93 
parametrization, 93 
aspherical, 199 
automorphism, inner, 46 

b (flat), 27-29 

(Poincare ball), 38 
Br{p) (geodesic ball), 106 
Br{p) (closed geodesic ball), 106 
ball, geodesic, 76, 106 
ball, Poincare, 38 
base of a vector bundle, 16 
Berger, Marcel, 203 

Berger metrics, 151 
bi-invariant metric, 46, 89 
curvature of, 129, 153 
existence of, 46 
exponential map, 89 
Bianchi identity 
algebraic, 122 
contracted, 124 
differential, 123 
first, 122 
second, 123 
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Bonnet 

Bonnet’s theorem, 9, 200 
Gauss-Bonnet theorem, 167 
boundary problem, two-point, 
184 

bundle 

cotangent, 17 
normal, 17, 133 
of /c-forms, 20 
of tensors, 19 
tangent, 17 
vector, 16 

calculus of variations, 96 
Caratheodory metric, 32 
Carnot-Caratheodory metric, 31 
Cartan’s first structure equation, 
64 

Cartan’s second structure 
equation, 128 

Cartan-Ambrose-Hicks theorem, 
205 

Cartan-Hadamard manifold, 199 
Cartan-Hadamard theorem, 9, 
196 

catenoid, 150 
Cayley transform, 40 
generalized, 40 

Chern-Gauss-Bonnet theorem, 
170 

Christoffel symbols, 51 

formula in coordinates, 70 
circle classification theorem, 2 
circles, 2 

circumference theorem, 2, 162, 
166 

classification theorem, 2 
circle, 2 

constant curvature metrics, 
9, 206 

plane curve, 4 
closed curve, 156 
closed geodesic ball, 76 
coframe, 20 


commuting vector fields, normal 
form, 121 

comparison theorem 

conjugate point, 195 
Jacobi field, 194 
metric, 196 
Rauch, 203, 204 
Sturm, 194 

compatibility with a metric, 67 
complete, geodesically, 108 
complex projective space, 46 
conformal metrics, 35 
conformally equivalent, 35 
conformally flat, locally, 37 
hyperbolic space, 41 
sphere, 37 
congruent, 2 
conjugate, 182 
conjugate locus, 190 
conjugate point, 182 

comparison theorem, 195 
geodesic not minimizing 
past, 188 

singularity of expp, 182 
connection, 49 

1-forms, 64, 165 
Euclidean, 52 
existence of, 52 
in a vector bundle, 49 
in components, 51 
linear, 51 

on tensor bundles, 53-54 
Riemannian, 68 

formula in arbitrary 
frame, 69 

formula in coordinates, 70 
naturality, 70 
tangential, 66 
connection 1-forms, 166 
constant Gaussian curvature, 7 
constant sectional curvature, 148 
classification, 9, 206 
formula for curvature 
tensor, 148 

formula for metric, 179 
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local uniqueness, 181 
model spaces, 9 
uniqueness, 204 
constant speed curve, 70 
contracted Bianchi identity, 124 
contraction, 13 
contravariant tensor, 12 
control theory, 32 
converge to infinity, 113 
convex 

geodesic polygon, 171 
set, 112 
coordinates, 14 

have upper indices, 15 
local, 14 
normal, 77 

Riemannian normal, 77 
slice, 15 

standard, on R”, 25 
standard, on tangent 
bundle, 19 

cosmological constant, 126 
cotangent bundle, 17 
covariant derivative, 50 
along a curve, 57-58 
of tensor field, 53-54 
total, 54 

covariant Hessian, 54, 63 
covariant tensor, 12 
covectors, 11 
covering 

map, 197 
metric, 27 
Riemannian, 27 
transformation, 27 
critical point, 101, 126, 142 
crystallographic groups, 206 
curvature, 3-10, 117 
2-forms, 128 

constant sectional, 9, 148, 
179-181, 204, 206 
constant, formula for, 148 
endomorphism, 117, 128 
Gaussian, 6-7, 142-145 
geodesic, 137 


in coordinates, 128 
mean, 142 

of a curve in a manifold, 137 
of a plane curve, 3 
principal, 4, 141 
Ricci, 124 
Riemann, 117, 118 
scalar, 124 
sectional, 9, 146 
signed, 4, 163 
tensor, 118 
curve, 55 

admissible, 92 
in a manifold, 55 
plane, 3 
segment, 55 

curved polygon, 157, 162 

cusp, 157 

cut 

locus, 190 
point, 190 

cylinder, principal curvatures, 5 

djdr (unit radial vector field), 

77 

djdx^ (coordinate vector field), 
15 

di (coordinate vector field), 15 
(covariant Hessian), 54 
VF (total covariant derivative), 
54 

(tangential connection), 66, 
135 

Vxh" (covariant derivative), 
49-50 

A (Laplacian), 44 

d{p,q) (Riemannian distance), 

94 

Dg (covariant derivative along 
transverse curves), 97 
Dt (covariant derivative along a 
curve), 57 

deck transformation, 27 
defining function, 150 
diameter, 199 
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difference tensor, 63 
differential Bianchi identity, 123 
differential forms, 20 
dihedral groups, 206 
distance, Riemannian, 94 
divergence, 43 

in terms of covariant 
derivatives, 88 
operator, 43 
theorem, 43 

domain of the exponential map, 
72 

dual 

basis, 13 
coframe, 20 
space, 11 

dV (Riemannian volume 
element), 29 

dVg (Riemannian volume 
element), 29 

£ (domain of the exponential 
map), 72 

E{n) (Euclidean group), 44 
edges of a curved polygon, 157 
eigenfunction of the Laplacian, 
44 

eigenvalue of the Laplacian, 44 
Einstein 

field equation, 126 
general theory of relativity, 
31, 126 

metric, 125, 202 
special theory of relativity, 
31 

summation convention, 13 
embedded submanifold, 15 
embedding, 15 
isometric, 132 
End(E) (space of 

endomorphisms), 12 
endomorphism 

curvature, 117 
of a vector space, 12 
escape lemma, 60 


Euclidean 

acceleration, 48 
connection, 52 
geodesics, 81 
group, 44 
metric, 25, 33 
homogeneous and 
isotropic, 45 
triangle, 2 

Euler characteristic, 167, 170 
Euler-Lagrange equation, 101 
existence and uniqueness 
for linear ODEs, 60 
for ODEs, 58 
of geodesics, 58 
of Jacobi fields, 176 
exp (exponential map), 72 
expp (restricted exponential 
map), 72 

exponential map, 72 
domain of, 72 
naturality, 75 
of bi-invariant metric, 89 
extendible vector fields, 56 
extension 

of functions, 15 
of vector fields, 16, 132 
exterior fc-form, 14 
exterior angle, 157, 163 

family, admissible, 96 
fiber 

metric, 29 
of a submersion, 45 
of a vector bundle, 16 
Finsler metric, 32 
first Bianchi identity, 122 
first fundamental form, 134 
first structure equation, 64 
first variation, 99 
fixed-endpoint variation, 98 
fiat 

connection, 128 
locally conformally, 37 
Riemannian metric, 24, 119 
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flat (b), 27-29 
flatness criterion, 117 
forms 

bundle of, 20 
differential, 20 
exterior, 14 
frame 

local, 20 
orthonormal, 24 
Fubini-Study metric, 46, 204 
curvature of, 152 
functional 

length, 96 
linear, 11 
fundamental form 
first, 134 
second, 134 
fundamental lemma of 

Riemannian geometry, 

68 

7 (velocity vector), 56 
7 (a)'") (one-sided velocity 
vectors), 92 

r(s,t) (admissible family), 96 
7 v (geodesic with initial velocity 
R), 59 

g (Euclidean metric), 25 
g (round metric), 33 
g^ (round metric of radius i?), 

33 

Gauss equation, 136 

for Euclidean hypersurfaces, 
140 

Gauss formula, 135 
along a curve, 138 
for Euclidean hypersurfaces, 
140 

Gauss lemma, 102 
Gauss map, 151 
Gauss’s Theorema Egregium, 6 , 
143 

Gauss-Bonnet 

Ghern-Gauss-Bonnet 
theorem, 170 


formula, 164 
theorem, 7, 167 
Gaussian curvature, 6 , 142 
constant, 7 

is isometry invariant, 143 
of abstract 2-manifold, 144 
of hyperbolic plane, 145 
of spheres, 142 
general relativity, 31, 126 
generalized Gayley transform, 40 
generating curve, 87 
genus, 169 
geodesic 

ball, 76, 106 
closed, 76 
curvature, 137 
equation, 58 
polygon, 171 
sphere, 76, 106 
triangle, 171 
vector held, 74 
geodesically complete, 108 
equivalent to metrically 
complete, 108 
geodesics, 8 , 58 

are constant speed, 70 
are locally minimizing, 106 
existence and uniqueness, 58 
maximal, 59 

on Euclidean space, 58, 81 
on hyperbolic spaces, 83 
on spheres, 82 
radial, 78, 105 
Riemannian, 70 
with respect to a 
connection, 58 
gradient, 28 

Gram-Schmidt algorithm, 24, 

30, 43, 143, 164 
graph coordinates, 150 
great circles, 82 
great hyperbolas, 84 
Green’s identities, 44 

H (mean curvature), 142 
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h (scalar second fundamental 
form), 139 

(hyperbolic space), 38-41 
Hr (hyperbolic metric), 38-41 
Hadamard 

Cartan-Hadamard theorem, 
196 

half-cylinder, principal 
curvatures, 5 
half-plane, upper, 7 
half-space, Poincare, 38 
harmonic function, 44 
Hausdorff, 14 
Hessian 

covariant, 54, 63 
of length functional, 187 
Hicks 

Cartan-Ambrose-Hicks 
theorem, 205 
Hilbert action, 126 
homogeneous and isotropic, 33 
homogeneous Riemannian 
manifold, 33 

homotopy groups, higher, 199 
Hopf, Heinz, 158 

Hopf-Rinow theorem, 108 
rotation angle theorem, 158 
Umlaufsatz, 158 
Hopf-Rinow theorem, 108 
horizontal index position, 13 
horizontal lift, 45 
horizontal space, 45 
horizontal vector field, 89 
hyperbolic 

metric, 38-41 
plane, 7 
space, 38-41 

stereographic projection, 38 
hyperboloid model, 38 
hypersurface, 139 

I{V,W) (index form), 187 
ix (interior multiplication), 43 
ideal triangle, 171 
identification 


Tl{V) = End(P), 12 
Ti+i{y) with multilinear 
maps, 12 

n (second fundamental form), 
134 

immersed submanifold, 15 
immersion, 15 

isometric, 132 
index 

form, 187 

of a geodesic segment, 189 
of pseudo-Riemannian 
metric, 30, 43 
position, 13 

raising and lowering, 28 
summation convention, 13 
upper and lower, 13 
upper, on coordinates, 15 
induced metric, 25 
inertia, Sylvester’s law of, 30 
inner automorphism, 46 
inner product, 23 

on tensor bundles, 29 
on vector bundle, 29 
integral 

of a function, 30 
with respect to arc length, 
93 

integration by parts, 43, 88 
interior angle, 2 
interior multiplication, 43 
intrinsic property, 5 
invariants, local, 115 
inward-pointing normal, 163 
isometric 

embedding, 132 
immersion, 132 
locally, 115 
manifolds, 24 
isometries 

of Euclidean space, 44, 88 
of hyperbolic spaces, 41-42, 
88 

of spheres, 33-34, 88 
isometry, 5, 24 
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group, see isometry group 
local, 115, 197 
metric, 112 
of M, 24 
Riemannian, 112 
isometry group, 24 

of Euclidean space, 44, 88 
of hyperbolic spaces, 41-42, 
88 

of spheres, 33-34, 88 
isotropic 

at a point, 33 
homogeneous and, 33 
isotropy subgroup, 33 

Jacobi equation, 175 
Jacobi field, 176 

comparison theorem, 194 
existence and uniqueness, 
176 

in normal coordinates, 178 
normal, 177 
on constant curvature 
manifolds, 179 
jumps in tangent angle, 157 

KN{t) (signed curvature), 163 
K (Gaussian curvature), 142 
Kazdan, Jerry, 169 
Klingenberg, Walter, 203 
Kobayashi metric, 32 

A^M (bundle of fc-forms), 20 
Lg{j) (length of curve), 92 
L{'-f) (length of curve), 92 
Laplacian, 44 
latitude circle, 87 
law of inertia, Sylvester’s, 30 
left-invariant metric, 46 
Christoffel symbols, 89 
length 

functional, 96 
of a curve, 92 
of tangent vector, 23 
lens spaces, 206 


Levi-Civita connection, 68 
Lie derivative, 63 
linear connection, 51 
linear functionals, 11 
linear ODEs, 60 
local coordinates, 14 
local frame, 20 

orthonormal, 24 
local invariants, 115 
local isometry, 88, 115, 197 
local parametrization, 25 
local trivialization, 16 
local uniqueness of constant 

curvature metrics, 181 
local-global theorems, 2 
locally conformally fiat, 37 
hyperbolic space, 41 
sphere, 37 

locally minimizing curve, 106 
Lorentz group, 41 
Lorentz metric, 30 
lowering an index, 28 

main curves, 96 
manifold, Riemannian, 1, 23 
maximal geodesic, 59 
mean curvature, 142 
meridian, 82, 87 
metric 

Berger, 151 

bi-invariant, 46, 89, 129, 153 
Caratheodory, 32 
Carnot-Caratheodory, 31 
comparison theorem, 196 
Einstein, 125, 202 
Euclidean, 25, 33, 45 
fiber, 29 
Finsler, 32 

Fubini-Study, 46, 152, 204 
hyperbolic, 38-41 
induced, 25 
isometry, 112 
Kobayashi, 32 
Lorentz, 30 
Minkowski, 31, 38 
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on submanifold, 25 
on tensor bundles, 29 
product, 26 

pseudo-Riemannian, 30, 43 
Riemannian, 1, 23 
round, 33 

semi-Riemannian, 30 
singular Riemannian, 31 
space, 94 

sub-Riemannian, 31 
minimal surface, 142 
minimizing curve, 96 

is a geodesic, 100, 107 
locally, 106 

Minkowski metric, 31, 38 
mixed tensor, 12 
model spaces, 9, 33 
Morse index theorem, 189, 204 
multilinear over 21 

multiplicity of conjugacy, 182 
Myers’s theorem, 201 

NM (normal bundle), 132 
Isf(M) (space of sections of 

normal bundle), 133 
Nash embedding theorem, 66 
naturality 

of the exponential map, 75 
of the Riemannian 
connection, 70 

nondegenerate 2-tensor, 30, 116 
nonvanishing vector fields, 115 
norm 

Finsler metric, 32 
of tangent vector, 23 
normal bundle, 17, 133 
normal coordinates, 

Riemannian, 77 
normal form for commuting 
vector fields, 121 
normal Jacobi field, 177 
normal neighborhood, 76 
normal neighborhood lemma, 76 
normal projection, 133 
normal space, 132 


normal vector field along a 
curve, 177 

(connection 1-forms), 64 
0{n, 1) (Lorentz group), 41 
0+(n, 1) (Lorentz group), 41 
0{n + 1) (orthogonal group), 33 
one-sided derivatives, 55 
one-sided velocity vectors, 92 
order of conjugacy, 182 
orientation, for curved polygon, 
157 

orthogonal, 24 
orthogonal group, 33 
orthonormal, 24 
frame, 24 

frame, adapted, 43, 133 
osculating circle, 3, 137 

Tr-*- (normal projection), 133 
TT^ (tangential projection), 133 
7*0*1 (parallel translation 
operator), 61 

pairing between V and V*, 1\ 
parallel 

translation, 60-62, 94 
vector field, 59, 87 
parametrization 

by arc length, 93 
of a surface, 25 
parametrized curve, 55 
partial derivative operators, 15 
partition of unity, 15, 23 
path-lifting property, 156, 197 
Pfaffian, 170 

piecewise regular curve, 92 
piecewise smooth vector field, 93 
pinching theorems, 203 
plane curve, 3 
plane curve classification 
theorem, 4 
plane section, 145 
Poincare 
ball, 38 
half-space, 38 
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polygon 

curved, 157, 162 
geodesic, 171 
positive definite, 23 
positively oriented curved 
polygon, 157, 163 

principal 

curvatures, 4, 141 
directions, 141 
product metric, 26 
product rule 

for connections, 50 
for divergence operator, 43 
for Euclidean connection, 67 
projection 

hyperbolic stereographic, 38 
normal, 133 
of a vector bundle, 16 
stereographic, 35 
tangential, 133 
projective space 
complex, 46 
real, 148 
proper 

variation, 98 

vector field along a curve, 98 
pseudo-Riemannian metric, 30 
pullback connection, 71 

R (curvature endomorphism), 

117 

R” (Euclidean space), 25, 33 
r{x) (radial distance function), 

77 

Rado, Tibor, 167 

radial distance function, 77 

radial geodesics, 78 

are minimizing, 105 
radial vector field, unit, 77 
raising an index, 28 
rank of a tensor, 12 
Rauch comparison theorem, 203, 
204 

Rc (Ricci tensor), 124 
real projective space, 148 


regular curve, 92 
regular submanifold, 15 
relativity 

general, 31, 126 
special, 31 

reparametrization, 92 

of admissible curve, 93 
rescaling lemma, 73 
restricted exponential map, 72 
Ricci curvature, 124 
Ricci identity, 128 
Ricci tensor, 124 

geometric interpretation, 
147 

symmetry of, 124 
Riemann 

curvature endomorphism, 
117 

curvature tensor, 118 
Riemann, G. F. B., 32 
Riemannian 

connection, 68-71 
covering, 27 
distance, 94 
geodesics, 70 
isometry, 112 
manifold, 1, 23 
metric, 1, 23 
normal coordinates, 77 
submanifold, 132 
submersion, 45-46, 89 
volume element, 29 
right-invariant metric, 46 
rigid motion, 2, 44 
Rm (curvature tensor), 118 
robot arm, 32 

Rot( 7 ) (rotation angle), 156 
rotation angle, 156 

of curved polygon, 158, 163 
rotation angle theorem, 158 
for curved polygon, 163 
round metric, 33 

# (sharp), 28-29 
S (scalar curvature), 124 
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s (shape operator), 140 
S” (unit n-sphere), 33 

(n-sphere of radius R), 33 
Sr{p) (geodesic sphere), 106 
scalar curvature, 124 

geometric interpretation, 
148 

scalar second fundamental form, 

139 

geometric interpretation, 

140 

Schoen, Richard, 127 
secant angle function, 159 
second Bianchi identity, 123 
second countable, 14 
second fundamental form, 134 
geometric interpretation, 
138, 140 
scalar, 139-140 

second structure equation, 128 
second variation formula, 185 
section of a vector bundle, 19 
zero section, 19 
sectional curvature, 9, 146 
constant, 148 
of Euclidean space, 148 
of hyperbolic spaces, 148, 
151 

of spheres, 148 
sections, space of, 19 
segment, curve, 55 
semi-Riemannian metric, 30 
semicolon between indices, 55 
shape operator, 140 
sharp (#), 28-29 
sides of a curved polygon, 157 
sign conventions for curvature 
tensor, 118 
signed curvature, 4 

of curved polygon, 163 
simple curve, 156 
singular Riemannian metric, 31 
singularities of the exponential 
map, 182 


5'L(2,R) (special linear group), 
45 

slice coordinates, 15 
smooth, 14 
space forms, 206-207 
special relativity, 31 
speed of a curve, 70 
sphere, 33 

geodesic, 76, 106 
homogeneous and isotropic, 
34 

principal curvatures of, 6 
sphere theorem, 203 
spherical coordinates, 82 
SSS theorem, 2 
standard coordinates 
on R”, 25 
tangent bundle, 19 
star-shaped, 72, 73 
stereographic projection, 35 
hyperbolic, 38 
is a conformal equivalence, 
36 

Stokes’s theorem, 157, 165 
stress-energy tensor, 126 
structure constants of Lie group, 
89 

structure equation 
first, 64 
second, 128 
Sturm 

comparison theorem, 194, 
208 

separation theorem, 208 
SU{2) (special unitary group), 
151 

sub-Riemannian metric, 31 
subdivision of interval, 92 
submanifold, 15 
embedded, 15 
immersed, 15 
regular, 15 
Riemannian, 25, 132 
submersion, Riemannian, 45-46, 
89 
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summation convention, 13 
surface of revolution, 25, 87 
Gaussian curvature, 150 
surfaces in space, 4 
Sylvester’s law of inertia, 30 
symmetric 2-tensor, 23 
symmetric connection, 63, 68 
symmetric product, 24 
symmetries 

of Euclidean space, 44, 88 
of hyperbolic spaces, 41-42, 
88 

of spheres, 33-34, 88 
of the curvature tensor, 121 
symmetry lemma, 97 
symplectic forms, 116 

T (torsion tensor), 63, 68 
T^(M) (space of 1-forms), 20 
T( 7 ) (space of vector fields along 
a curve), 56 

Tj^M (bundle of mixed tensors), 
19 

Tf (M) (space of mixed tensor 
fields), 20 

T^(M) (space of covariant tensor 
fields), 20 

T^{V) (space of covariant 
fc-tensors), 12 

T^{V) (space of mixed tensors), 
12 

Ti{V) (space of contravariant 
Mensors), 12 
TM (tangent bundle), 17 
7{M) (space of vector fields), 19 
TM\m (ambient tangent 
bundle), 132 

7{M\m) (space of sections of 
ambient tangent 
bundle), 133 

T*M (cotangent bundle), 17 
tangent angle function, 156, 157, 
163 

tangent bundle, 17 
tangent space, 15 


tangential 

acceleration, 48 
connection, 66, 135 
projection, 133 
vector field along a curve, 
177 

tensor 

bundle, 19 
contravariant, 12 
covariant, 12 
field, 20 

fields, space of, 20 
mixed, 12 
of type (^), 12 
on a manifold, 19 
product, 12 

tensor characterization lemma, 

21 

Theorema Egregium, 6, 143 
torsion 

2-forms, 64 
tensor, 63, 68 

torus, n-dimensional, 25, 27 
total covariant derivative, 54 
components of, 55 
total curvature theorem, 4, 162, 
166 

total scalar curvature functional, 
126, 127 

total space of a vector bundle, 16 
totally awesome theorem, 6, 143 
totally geodesic, 139 
trg (trace with respect to g), 28 
trace 

of a tensor, 13 
with respect to g, 28 
transformation law for Ef, , 63 
transition function, 18 
translation, parallel, 60-62 
transverse curves, 96 
triangle 

Euclidean, 2 
geodesic, 171 
ideal, 171 

triangulation, 166, 171 
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trivialization, local, 16 
tubular neighborhood theorem, 
150 

two-point boundary problem, 

184 

(Poincare half-space), 38 
Umlaufsatz, 158 
uniformization theorem, 7 
uniformly normal, 78 
uniqueness of constant curvature 
metrics, 181 

unit radial vector field, 77 
unit speed 
curve, 70 

parametrization, 93 
upper half-plane, 7, 45 
upper half-space, 38 
upper indices on coordinates, 15 

vacuum Einstein field equation, 
126 

variation 
field, 98 
first, 99 

fixed-endpoint, 98 
of a geodesic, 98 
proper, 98 
second, 185 
through geodesics, 174 
variational equation, 101 
variations, calculus of, 96 
vector bundle, 16 
section of, 19 
space of sections, 19 


zero section, 19 
vector field, 19 

along a curve, 56 
along an admissible family, 
96 

normal, along a curve, 177 
piecewise smooth, 93 
proper, 98 

tangential, along a curve, 
177 

vector fields 

commuting, 121 
space of, 19 

vector space, tensors on, 12 

velocity, 48, 56 

vertical index position, 13 

vertical space, 45 

vertical vector field, 89 

vertices of a curved polygon, 157 

volume, 30 

volume element, 29 

Warner, Frank, 169 
wedge product, 14 

alternative definition, 14 
Weingarten equation, 136 

for Euclidean hypersurfaces, 
140 

Wolf, Joseph, 206 

x{M) (Euler characteristic), 167 

Yamabe problem, 127 

zero section, 19 
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